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Figure 1 

Restriction enzyme analysis of CPN100686 (RY 54 - SEQ ID NO. l) 

Alwl 
Fnu4HI 
Taul 
Acil 
MspAlI 
Dpnl | 
Hpyl78III Cjel| j 
Acil Mnll |Sau3AI | | | 



I I 



TspRI 
Btsl I 



Cjel CjePI 
Sspl CviRI | 

I I 



ATGGACTTCCGCATATTGTCAGGAGGGGATCAGCGGCACTGCTAATGGACAATATTCTGC 



■+ 60 



TACCTGAAGGCGTATAACAGTCCTCCCCTAGTCGCCGTGACGATTACCTGTTATAAGACG 



Taal 
BsaJI | 
BstDSI j 
Cjel || Bed 
I II I 



Fokl 
Sfcl 
Cvi JI I 



Taal 

Cjel CjePI | 



CviJI 
BsmFI | 
FHU4HI I j 
Tsell | | 



Bbvl 
Dral 
Msel | 



AAACCGTGGATGGCGTATGGCTGTAGTGATTGACGGTTATATGGTCAGCAGCCCTATTTT 



61 



-+ 120 



TTTGG CAC CTAC CG C AT ACCGACATCACTAACTGC CAATATACCAG TCGTCGGGATAAAA 



Maell 



Apol 
Tsp509I 
BsrI BsmAI | 

MslI Ddel | | 
NlaIII| TspRI | | BseMII Taal 
II III II 



AAACGTCCCATTGAAAAATCATGCCAGTGTCTCAGGGAAATTTACCCACCGTGAAGTGAG 

121 + + + + + + 180 

TTTGCAGGGTAACTTTTTAGTACGGTCACAGAGTCCCTTTAAATGGGTGGCACTTCACTC 



Hpyl78III 
BseMII | 
Mnll 
Dral | 
Haeivj 
Hpyl88IX Hin4I ] 
Ddel | Msel | j 
I I III 

CAAACTCGCCTCAGATTTAAAATCTGGAGCGATGTCTTTTGTTCCCGAGGTTCTCAGTGA 

181 + + + + + + 240 

GTTTGAGCGGAGTCTAAATTTTAGACCTCGCTACAGAAAACAAGGGCTCCAAGAGTCACT 



BsmAI 
BsmBI 
Earl 
Ddel 
Sthl32I 
Bpml 
BsaJI 
Hpyl78III 
Aval | 
Mnll | | 
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gure 1 (Continued) 

Dpnl 
Earl 
Sau3AI 
Hpyl88IX 
MboII 
Dpnl 
BseMII | 
Hin4 1 1 I 
Sau3AIi I 
MboII I I 1 
TspRI I lit 
II lit 



BplI 
I 



Rsal 
BsrGI I 
TatI I 
I I 



Ddel 
I 



Nlalll 
Nspl 
SphI 
Cac8i I 
I I 



241 



AGAGACGATCTC.TTCTGATCTTGGGAAAAAACAATGTACACAAGGCATTATCTCAGCATG 

_ i | . ^ — ■ — _ .4. _ — — — — H — r 

TCTCTGCTAGAGAAGACTAGAACCCTTTTTTGTTACATGTGTTCCGTAATAGAGTCGTAC 



300 



BseMII 
CviJI 



BsrDI 



Mnll 
Hgal ] 



Acelll 
Sthl32I 
Hin4I I 
BsaHI I I 



301 



! I M HI 

CTGTGGCTTGGCAATGCTTATTGTTTTGATGAGCGTATATTATAGATTTGGAGGCGTCAT 

+ + + — , + + 

GACACCGAACCGTTACGAATAACAAAACTACTCGCATATAATATCTAAACCTCCGCAGTA 



Acelll 
Bbvl 
Taal 
SfaNI 
Sfcl 



360 



Alul 
CviJI 

MboII I Hinfl 

Mwol I Tfil 

Hpyl78III| ! Hpyl88IX I 

lit II 



Alul 
CviJI 
Fnu4HI I 
Tsell I 
Cjel I I I 
I 1 I I 



CGCTTCGGGAGCTGTTCTTCTGAATCTTTTGCTTATCTGGGCAGCTCTACAGTATTTGGA 



361 



-+ 420 



GCGAAGCCCTCGACAAGAAGACTTAGAAAACGAATAGACCCGTCGAGATGTCATAAACCT 



Hhal 
HphI I 



I 



Hinfl 
Hpyl76III 
Plel I 
Cjel I I 
rokl I ! I 
I I I 



Beef I 
I 



CviJI 
Haelll 
Bed 
Eael 
Gdill 
SfaNI I 



Mwol 



4 21 



TGCGCCACTCACCTTGTCAGGACTCGCTGGGATTGTTCTTGCTATGGGGATGGCCGTAGA 
ACGCGGTGAGTGGAACAGTCCTGAGCGACCCTAACAAGAACGATACCCCTACCGGCATC? 



'4 80 
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Figure 1 (continued) 



CviRi 



Hpyi88ix 
Mnll | 
NspV Hinflj j Apol 



BsmAI Msel 



Fokl | TaqI Tfil| |Tsp509I 

II I II I I II 

GCAAATGTTCTTGTATTCGAAAGAATCCGAGAGGAATTTTTATTGTCTCAAAGTCTTAA 



481 



540 



ACGTTTACAAGAAC^TAAGCTTTCTTAGGCTCTCCTTAAAAATAACAGAGTTTCAGAATT 

CviJI CviJI 
BsaJI | NlaIV| Hinfl 
Sfcl Styl j Mwol | j Tfil Sfcl 

I I I I II I I 

AAAATCTGTAGAAAAAGG ATATACCAAGG CTTTTGGAGC CATTTTTG ATTCTAACTTGAC 



541 



-+ 600 



601 



TTTTAGACATCTTTTTCCTATATGGTTCCGAAAACCTCGGTAAAAACTAAGATTGAACTG 

BbvCI 
BpulOI 

Ddel CviJI 
CviJI | BseMII Haelll BslI 

Hael | Mnll | EcoO109I |EcoNI | 

Taal Haelll j MboII | j Bfal Sau96I | Msel j 

I II I I I I I I I I 

TACAGTATTGGCCTCAGCACTTCTTTTCTTCCTAGATACAGGGCCTATTAAAGGGTTTGC 

1- + h h + + 

ATGTCATAACCGGAGTCGTGAAGAAAAGAAGGATCTATGTCCCGGATAATTTCCCAAACG 



Apol 
Tsp509I 

MboII 
Beef I 

Apol Nlalll | 

MboII Hpyl78III | j 

Tsp509I Earl CviJI Real | j j 



660 



Earl 
I 

TTTGACATTGATTTTAGGAATTTTCTCTTCAATGTTTACGGCTCTTTTCATGACTAAATT 



661 



-+ 720 



AAACTGTAACTAAAATCCTTAAAAGAGAAGTTACAAATGCCGAGAAAAGTACTGATTTAA 



Ndel 

Fokl CviRI | 

Nlalll Siral | Taal | j XmnI 

I II III I 

TTTCTTCATGCTGTGGATGAATAAGACCCAACATACACAGTTGCATATGATGAATAAGTT 

721 + + + + + + 780 

AAAGAAGTACGACACCTACTTATTCTGGGTTGTATGTGTCAACGTATACTACTTATTCAA 
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Hpyl78III 
Smll 
Mnll | 
SfaNI | 
Nlalll | j 



Hpyl78III 
CviJI | 
Bce83I | | 
Fokl| | .| 



781 



CviRI 
I 

CGTGGGGATAAAGCATGATTTCTTGAGAGGATGCAAAAAACTTTGGGCTGTTTCTGGAAG 



-+ 840 



GCACCCCTATTTCGTACTAAAGAACTCTCCTACGTTTTTTGAAACCCGACAAAGACCTTC 



Apol 
EcoRI 
TspSOSI 
ScrFI 
CviJI | 
EcoRII j 
MlaIV| j 
II I 



Sthl32I 

f 



Aval 



I 



841 



TGTTTTTCTTTTAGGTTGCGTTGCTCTCGGGTTTGGAGCCTGG^^^^COTXTTGGGAAT 
+ + + +"^» + + 

ACAAAAAGAAAATCCAACGCAACGAGAGCCCAAACCTCGGACCTTAAGGCAAAACCCTTA 



900 



Dral 
Msel| 
Mnll | | 



Msel 



Nlalll 



SfaNI 



I 



GGATTTTAAAGGAGGGTATGCCTTTACCTTTAATCCAAAAGAGCATGGCATCAGCGATGT 
901 + + + + + + 96Q 

CCTAAAATTTCCTCCCATACGGAAATGGAAATTAGGTTTTCTCGTACCGTAGTCGCTACA 



CviRI 



Sfcl 



MboII 

Alul| 
CviJI 



Hpyl78III 
Bfal | 
Xbal ) j 
BsmAI I 



I 



TG CTCAAATG CGTGG CAAAGTTGTG C ATAAACTACAGGAAG C TG GTCTTTCTTCTAGAGA 

961 + + + + + + 1020 

ACGAGTTTACGCACCGTTTCAACACGTATTTGATGTCCTTCGACCAGAAAGAAGATCTCT 

BsaBI 
Dpnl 
Sau3AI 
Alwl 
Hpyl8 8IX | 
Tthlllll | 
Dpnl | | 

BstYI | || Ml Alul 
Sau3AI j || III CviJI 
Eco57l MboII || jj III Hindi II 
I I I I II 
CTTCCGTATTCAAACATTTGGATCTTCAGAAAAGATCAAAATCTATTTTAGTGATAAAGC 
1021 + + + + + + 1080 

GAAGGCATAAGTTTGTAAACCTAGAAGTCTTTTCTAGTTTTAGATAAAATCACTATTTCG 
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Figure 1 (continued) 



Cac8i 
RleAi 
Alul 
CviJi 
Nlalll 
Hpyl78III 
Real 
BplI 



Alul 
CviJi 
Msel | 
I I 



Ddel 
I 



Hin4I 
CviJi | 

• I I 



Dpnl 
Sau3AI 
Msel 
AceIIl| 
Tsp509I | j 
Mnll | | j 

II II 



TTTAAG CTATACTAAG CAGATACG AG CCTCTCTC CTAAAATTAACGATCATGAG CTGG CG 

1081 + + + + + + H40 

AAATTCGATATGATTCGTCTATGCTCGGAGAGAGGATTTTAATTGCTAGTACTCGACCGC 



Hpyl8 8IX 



Bfal 
CviJi 
Hael 
Haeiri 
StuI 



I 



TTATTGTGGGATTGTTGTCAGAAACAGGCCTAGATTTCTCTACGGAAACTCTAAACGAAA 

1141 + + + + + + 1200 

AATTACACCCTAACAACAGTCTTTGTCCGGATCTAAAGAGATGCCTTTGAGATTTGCTTT 



Bcgl 

Apol Fnu4HI Bbvl 

Tsp509I Tselj TaqI | 

I II II ' I 

CGCAAAATTTTGGTCAAAGGTAAGCAGCAAACTATCGAAGAAAATGCGTTATCAGGCGAC 



Sthl32I 
MboII | Bcgl 



1201 



-+ 1260 



GCGTTTTAAAACCAGTTTCCATTCGTCGTTTGATAGCTTCTTTTACGCAATAGTCCGCTG 



Alul 

Bed CviJi CviJi Hhal 

ill i 

CATCGGGCTTTTAGGAGCTTTGGCAATCATCTTGCTCTATGTGAGTTTGCGCTTTGAATG 

1261 + + + + + + 1320 

GTAGCCCGAAAATCCTCGAAACCGTTAGTAGAACGAGATACACTCAAACGCGAAACTTAC 
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CviRI 
CviJI Mwol | 



f=3 

TIB- 

Li J 



fn 



Nlalll 
Hpyl78III 
Tsp509I 
Msel | 

TspRI Hhal | j 
Beef I Mwol | Mwol | j| Real 

I i I I I II I 

GCAATATGCTTTCAGTGCCGTATGCGCTTTAATTCATGACCTTTTGGCTACCTGTGCAGT 

1321 + + + + + + 1380 

CGTTATACGAAAGTCACGGCATACGCGAAATTAAGTACTGGAAAACCGATGGACACGTCA 

CviJI 

Apol CacBI | 

BsgX Tsp509I MboII CviRI | | Mwol 



I II I I I I 

CTTGTTTATAG CACATTTCTTTTTGAAG AAAATTC AAATAG ATTTG CAAG CCATTGGTG C 



1381 



-+ 1440 



GAACAAATATCGTGTAAAGAAAAACTTCTTTTAAGTTTATCTAAACGTTCGGTAACCACG 

Dpnl 

Bell | Dpnl 
Msel Taal Msel Sau3AI j Sau3AI |Hpyl78III 

II I Mill 

TTTAATGACTGTATTGGGGTATTCATTAAACAATACTTTGATCATTTTTGATCGTATTCG 

1441 + + + + + + isoo 

AAATTACTGACATAACCCCATAAGTAATTTGTTATGAAACTAGTAAAAACTAGCATAAGC 



SfaNI 
Nlalll 
Nspl 

Dpnl Nsil | 

Sau3AI | MboII CviRI | j | Msel 

III III 
TGAAGATCGCCAAGCGAACCTGTTTACCCCTATGCATGTTTTAGTTAATGATGCCCTTCA 

1501 + + + + + + 1560 

ACTTCTAGCGGTTCGCTTGGACAAA.TGGGGATACGTACAAAATCAATTACTACGGGAAGT 



Acil 
Fnu4HI 

Taul MslI Alul 
Maell CviJI | Taal | CviJI Msel 

I II II I I 

AAAGACGTTCAGCCGCACGGTAATGACAACAGCTACAACTCTATCAGTTTTGTTAATGCT 

1561 + + + + + + 1620 

TTTCTGCAAGTCGGCGTGCCATTACTGTTGTCGATGTTGAGATAGTCAAAACAATTACGA 



Nlaiv 
CviJI 
Fnu4HI | 
Taul j 
BseRI Acil | j 
I II I 



Mnll 
Tsp509I 



CjePI Hinfl 
| MboII Tfil 

I I I 

TTTGTTTATAGGCGGCTCCTCTGTCTTTAATTTTGCATTTATTATGACCATAGGGATTCT 

1621 + + + + + + 1680 

AAACAAATATCCGCCGAGGAGACAGAAATTAAAACGTAAATAATACTGGTATCCCTAAGA 



Msel | CviRI 
I I 
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BsmAI Avail 
Bfal CjePI BsmBI CviRI Mnll Sau96I 

i ill ii 

TCTAGGAACTTTATCGTCTCTTTATATTGCACCACCTCTGTTGTTGTTTATGGTCCGTAA 



1681 



AGATCCTTGAAATAGCAGAGAAATATAACGTGGTGGAGACAACAACAAATACCAGGCATT 

Msel 

Taal | AflHI 



1740 



Msel 



Mae 1 1 



Rsal | | 

III I I 

AGAAAATCGCTCAAAATAAGTACCGTTAAACTTAATCTAACGTGTAGCAATATAAAAATC 



1741 



■+ 1800 



BsmFI 

i 



TCTTTTAG CG AGTTTTATTCATGGCAATTTGAATTAGATTG CACATCGTTATATTTTTAG 



NlalV 
Cvi JI 
Haelll 
EcoO109I | 
Sau96I | 
PshAI BsmFI 1 



Apol 
Tsp509I 
Msel | 

I I 



Hpyl8 8IX 
Apol | 
Tsp509I | 



TCCTTTGGGACTTTAGTCCCAAAGGCCCCTGTGGTATTAAATTTATGACAAATTCAGATA 



1801 



-+ 1860 



AGGAAACCCTGAAATCAGGGTTTCCGGGGACACCATAATTTAAATACTGTTTAAGTCTAT 

ATGC 

1861 1864 

TACG 
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Figure 2 

Restriction enzyme analysis of CPN100696 (RY 55 - SEQ ID NO . 2) 



Dral Bed 
MsellCviJI I 



Msel 

Apol Pad 

Tsp509I Vspl 

Msel |Tsp509l I 

I TspSOSI | | Msel | | Bfal 

II I I I I I I! I | 

TTATTTTAAAAGCCCATCTTTTTAGGTATGTAATTAAAATTTTTAATTAATGTTTTCCTA 
1 + „ + + + + + 

AATAAAATTTTCGGGTAGAAAAATCCATACATTAATTTTAAAAATTAATTACAAAAGGAT 

Fokl BscGI 
Maelll BspMI Bfal Taal | Mnll I 

' I III. |f 

GTGTAACCTGCTTCTTTAGGAACTACACTAGGAGAACGGTATGTCATCAAATCTACATCC 



60 



61 



CACATTGGACGAAGAAATCCTTGATGTGATCCTCTTGCCATACAGTAGTTTAGATGTAGG 

Bbvl 
Hinf I 
Ddel 
Hpyl78III 
Fnu4HI 
Alul| 
CviJT j 
MspAlI j 
PvuII j 
Tselj 
BseMII | | 

Sthl32I Fnu4HI j j 

Bsll| Bbvl Tsel| jj | | Plel Mnll 

II I MM II II 



120 



121 



CGTAGGAGGAACAGGAACAGGAGCAGCTGCTCCTGAGTCTGTGCTAAACATAGTAGAGGA 

+ + H H 1 + 

GCATCCTCCTTGTCCTTGTCCTCGTCGACGAGGACTCAGACACGATTTGTATCATCTCCT 



180 



Maelll 
Tsp45I 
Bbvl | 
SfaNI | MspAlI 



Xcml 
ScrFI 
BspGI 
EcoRII 
Tthlllll 
BspGI 
Cjel | 
Maell 



Fnu4HI 
Tsel | 

Sthl32I | | Hpnl | | Acil | AccI Tsp509I iBsrl 
Ml I I I I I I III 

AATAGCAGCATCGGGGAGTGTCACCGCTGGTCTACAAGCAATTACGTCCAGTCCAGGAAT 
181 + + + + + + 24Q 

TTATCGTCGTAGCCCCTCACAGTGGCGACCAGATGTTCGTTAATGCAGGTCAGGTCCTTA 
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Hinfl Bed 
Tfil HphI Cjel | 



Apol 
Tsp509I 
Fokl | 
I I 



Hinfl 
Tfil 
BsaAI | 
Maell) | 
II I 



241 



GGTGAATCTACTCATAGGATGGGCAAAGACAAAATTTATTCAACCTATACGTGAATCAAA 

+ + + + - + 1- 

CCACTTAGATGAGTATCCTACCCGTTTCTGTTTTAAATAAGTTGGATATGCACTTAGTTT 



300 



Tsp509I 
Cac8I 
Alul | 

Alul CviJI j 

CviJI Hpyl78III | j 
I III 



Apol 
EcoRI 
CjePI Tsp509I 
I 



301 



GCTCTTTCAATCCAGAGCTTGCCAAATTACCCTGCTCGTTTTAGGAATTCTTTTGGTTGT 

-f + h h H + 

CGAGAAAGTTAGGTCTCGAACGGTTTAATGGGACGAGCAAAATCCTTAAGAAAACCAACA 

Cjel 
MboII 
Nlalll | 
CjePI | | 

Mwol| Nsplj 



360 



Cjel 
Nsilj 
CviRI | | Bbvl 



361 



BsrI 
CviJI | BslI 

I I I 

TGCTGGATTAGCATGTATGTTTATCTTCCATAGCCAGTTAGGGGCAAATGCATTTTGGTT 
+ + + + + j. 

ACGACCTAATCGTAC ATACAAATAGAAGGTATCGGTCAAT CCC CGTTTACGTAAAAC CAA 



420 



Maelll 

Fnu4HI Maelll Bfal | 

Tsel | Msel | Spel| | MslI Hindlll 

II I I II I I | 

GATTATTCCTG CTG CCATAGG ATTGATTAAGTTACTAGTTAC ATCATTATGTTTTGATG A 



421 



-+ 480 



CTAATAAGGACGACGGTAT C CTAACTAATT CAATGATCAATGTAGTAATACAAAACTACT 



Hpyl88IX 
RsaX 
BsrGI 



TatI 
Alul | 
CviJI j 

I I 



Nlalll BspMI BslI Aarl 
I I I I 



Dpnl 
Sau3AI I 



AGCTTGTACATCTGAAAAACTCATGGTTTTCCAAAAATGGGCAGGTGTTTTAGAAGATCA 
481 + + + + + + 5 4 o 

TCGAACATGTAGACTTTTTGAGTACCAAAAGGTTTTTACCCGTCCACAAAATCTTCTAGT 
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Figure 2 (continued) 



Dpnl 
Nlaiv 
BamHI 
BstYI 
Sau3AI 
Acelll 
Bed 
Alwl 
MboXI 



Taql 
Alul 
CviJI | | 
I I I 



Alwl 
Msel | 



Nlalll 
CviJI 
Hael 
Haelll 
MscI 
Eael I 



I I 



GCTCGATGATGGGATCCTTAATAACTCAAATAAGATTTTTGGCCATGTGAAAACAGAAGG 

541 + + + + + ■ + 

CG AG CTACTACCCTAGGAATTATTGAGTTTATTCTAAAAAC CGGTAC ACTTTTGTCTTC C 



600 



'S3 



Mnll 
CviJI | 
Bfal | I 



Msel 
Rsal 
Seal 
TatI | 
Bmrl | | 
BsrlM | 



SacII 
Acil 
MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
Fnu4HI 
Taul 
CviJI 
Haelll 
Bed | 
Eael | 
Gdill | 



Rsal 
TatI | 
Hpfal| | 
II I 

AAATACCTCTAGGGCTACTACCCCAGTACTTAATGATGGCCGCGGAACTCCTGTACTTTC 

601 + + + + + + 660 

TTTATGGAGATCCCGATGATGGGGTCATGAATTACTACCGGCGCCTTGAGGACATGAAAG 



Thai 
CacSI | 
Alul | | 
CviJI I 



Tthlllll 
SfaNI | 
SfaNI | J 
Bfal | | | 



Fokl 
Mae I I | 

Mi ii ii i i 

ACCTTTAGTAAGTAAAATAGCTCGCGTTTAGACGTTCATCTCACAAGCATCCTAGAACTT 

661 + + + + + + 720 

TGGAAATCATTCATTTTATCGAGCGCAAATCTGCAAGTAGAGTGTTCGTAGGATCTTGAA 
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Hpyl88IX 
Dpnl 
Sau3AI 
Rsal 
BsaAI | 
SunI j 
Maell| j 
Fokl | j j 



Tsp509I 
Taal | 



III I I I I II 
GGGATGCTACTTTCCACGTACGAGATCAGATGTAAAGAGCAACAGTAATTATTTTCTACA 



721 



■+ 780 



CCCTACGATGAAAGGTGCATGCTCTAGTCTACATTTCTCGTTGTCATTAATAAAAGATGT 



TspRI 

Taal | Nlalll 

I I I 
CTGTTGTAATAAAATCATGT 

781 + + 800 

GACAACATTATTTTAGTACA 
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Figure 3 

Restriction enzyme analysis of CPN100709 (RY 57 - 

Dpnl 
Sau3AI 
Hpyl78III 
Alwl 
Hinf I 
Tfil 
Taal 
Bcgl 
Nsil 



SEQ ID NO . 3) 



Dpnl 
Sau3AI | 
Cac8I | | 

I I I 



CviRI 
Nlalll 
Nspl 



TaqI 



Acil 
I 



Rsal 
TatI | 



TGCTGGCAGATCGTTTCCACATGCATACTGTGAATCTCGATCCCTATGCGGAAAATGTAC 



-+ 60 



ACGACCGTCTAGCAAAGGTGTACGTATGACACTTAGAGCTAGGGATACGCCTTTTACATG 



61 



Apol 
EcoRI 

Bcgl Msel Bfal Tsp509I 

II II 

TTGTAAACTTAAAAACCATAGCGACGACTTTTTCTAGTTTATGACAATACGAATTCTTGC 

+ + + + + + 

AACATTTGAATTTTTGGTATCGCTGCTGAAAAAGATCAAATACTGTTATGCTTAAGAACG 

Alul 
CviJI 
Bfal 
CviJI 
Hael 
Haelll 
Stul 



120 



I 



ECOS7I 
Maelll | 



HaelV 
Hin4I 
NlalV | 
Avail | | 
Sau96l| j 



Nlalll 
TaqI I 
Hpyl78III | 
Real | j 
BsmFI | j j 
II 



121 



TGAAGGCCTAGCTTTCCGTTACGGAAGCAAGGGACCGAATATCATTCATGATGTTTCTTT 

+ *■ h + h + 

ACTTCCGGATCGAAAGGCAATGCCTTCGTTCCCTGGCTTATAGTAAGTACTACAAAGAAA 



180 



Mnll 

Hinfl Avail | 
Bed Tfil Sau961 j BslI 

I I I I I 

CTCTGTCTATGATGGCGACTTTATAGGAATCATAGGACCAAACGGAGGGGGGAAAAGCAC 
181 + + + + + + 24Q 

GAGACAGATACTACCGCTGAAATATCCTTAGTATCCTGGTTTGCCTCCCCCCTTTTCGTG 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39158 



9/868987 



PCT/CA99/01230 



Figure 3 (continued) 



13/96 



Msel 
I 



Tsp509I 
Msel| 
II 



Cac8I 
CviJI | 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl8 8IX| 
BslI | | 
Alwl | j.j 



Alwl 



Bbsl 
XmnI | 



I I I 



241 



CTTAACGATGTTAATTTTGGGCTTGCTTACTCCTACATTCGGATCCTTGAAGACTTTCCC 



GAATTGCTACAATTAAAAC CCGAACGAATGAGGATGTAAG C CTAGG AACTTCTGAAAGGG 

SacII 
Acil 
MspAlI 
Thai 
Acil 
BsaJT 
BstDSI 
BsmI 
Faul 
Sthl32I | 
MboII | | 
I II 



-+ 300 



Dpnl 

Nlalll) NlalV 
Cjel Sau3AI | j CjePI | 

I I 



Alwl 



Cjel 



I I 



301 



TTCGCATTCCGCGGGGAAACAAACCCATTCCATGATCGGTTGGGTTCCCCAACATTTCTC 



-+ 360 



AAGCGTAAGGCGCCCCTTTGTTTGGGTAAGGTACTAGCCAACCCAAGGGGTTGTAAAGAG 



BsmAI 

Dpnl Hpyl78III MboII 

Sau3AI | CjePI Ddel BseMII Ddel |MnlI BseMII | 

II I I I I I I " I | 

TTATGATCCTTGTTTTCCTATCTCAGTAAAAGATGTTGTCCTCTCAGGAAGATTGTCTCA 



361 



AATACTAGGAACAAAAGGATAGAGTCATTTTCTACAACAGGAGAGTCCTTCTAACAGAGT 



-+ 420 



Dpnl 
Mmel 
Sau3AI 
Sfcl | | Dpnl 
Alulj j j BstYI | 

Cvi JI | j j Sau3AI j 

I I I II I I || 
ACTCTCCTGGCATGGAAAATATAAAAAGAAAGATTTTGAAGCTGTAGATCACGCTTTGGA 
421 + + + + + + 48Q 

TGAGAGGACCGTACCTTTTATATTTTTCTTTCTAAAACTTCGACATCTAGTGCGAAACCT 



ScrFI 
EcoRII | Nlalll 
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Alwl Hpyl88IX 



481 



Hpyl78III 
Ddel | 

TspRI Mnll | I BseMII 

BtsI " | Bed | | | Hin4I I 

I I II I I I I I | 

TCTTGTTGGACTTTCTGACACCACCACCACTGCTTTCGCCCATCTCTCAGGAGGACAAAT 

+ + + + + + 54Q 

AGAACAACCTGAAAGACTGTGGTGGTGGTGACGAAAGCGGGTAGAGAGTCCTCCTGTTTA 



CviJT 
BpulOI | 
Ddel | 
CviJI | | 



Hpyl78III 
Tsp509I 
Apol | 
Tsp509I | 
Mnll | Mselj 



541 



CviJT 



Rsal 
TatI | 

I I III 

CCAGCGTGTACTTCTGGCAAGAGCCTTAGCCTCCTACCCTGAAATTTTAATTCTTGATGA 

+ + + + + + 600 

GGTCGCACATGAAGACCGTTCTCGGAATCGGAGGATGGGACTTTAAAATTAAGAACTACT 

Tthlllll 
Hpyl78III 
Dpnl | 
Sau3AI | | 
Alwl I I I 



I 



Apol 
Tsp509I Msel 



Alul 
CviJI 
BciVI | 

I I 



601 



GCCGACGACAAACATTGATCCTGACAATCAACAAAGAATTTTAAGTATCCTAAAAAAGCT 

+ + + + + + 66Q 

CGGCTGCTGTTTGTAACTAGGACTGTTAGTTGTTTCTTAAAATTCATAGGATTTTTTCGA 

BsiHKAI 
Bspl286I 
BseSI 
CviRI 
ApaLI 
BsaAI I 



Dpnl 
Sau3AI 
Hpyl78III 

HphI 
MboII 
Maelll | 
BstXI | | 
MslI | | j 

III I 



661 



Maell 
Rsal | 
SunI | j 

Ta *I | | I I I j Msil | | | Ml | Tsp509I Msel 

CAACCGTACGTGCACCATTCTTATGGTAACTCACGATCTTCACCATACGACGAATTACTT 



GTTGGCATGCACGTGGTAAGAATACCATTGAGTGCTAGAAGTGGTATGCTGCTTAATGAA 



-+ 720 



Bcgl 
CviRI 



TaqI Msel 
I 



721 



TAATAAAGTTTTTTATATGAACAAAACTTTGCACTTCATTGGCAGACACTTCGACCTTAA 

+ + + + + + ?80 

ATTATTTCAAAAAATATACTTGTTTTGAAACGTGAAGTAACCGTCTGTGAAGCTGGAATT 
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BseRI 
Apol | 
Tsp509I I 
Hpyl78III j |NlaIII 



Tsp509I 
Bcgl| 

Fokl|| Hpyl78III j |NlaIII Mnll 

III I I I I I 

CAGACCAATTTTGTTGTCATCCTATAAAAATCAGGAATTTTCATGCTCTCCTCACTAATC 



781 



-+ 840 



GTCTGGTTAAAACAACAGTAGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAG 



RleAI 
Fnu4HI | BsaXI 
Taul j CviJI | 

Hinfl AcilJ j NlaIV| j 

Tfil Bfal | j | Mwol | j | 

I I II I I II I 

CGTGATTCATTTCCCCTTCTTATTTTACTTCCCACATTCCTAGCGGCATTAGGAGCCTCC 

841 + + + + + + goo 

GCACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGG 



Fnu4HI 
Taul 
Acil 
Cac8I 
Mnll 
Alul | 
CviJI j 
Mwol | | 



NlalV 



Mae 1 1 



GTAGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTC 

901 + + + + + 950 

CATCGACCGCCGCAATACCCTTGGATATAGCAACATTTTGCATAACAAAG 
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Figure 4 

Restriction enzyme analysis of CPN100710 (RY 58 - SEQ ID NO . 4) 

Dpnl 

Xmnl Sau3AI | 
Apol | Ddel | | Tsp509I 
Tsp509I |HphI | j |AciI Ddel Msel | 

I I I I I I I I || 
GAGAATTTTTTCCTAAGATCACCGCTTCTTAGGATATTCGTTCTTTATTAAAATTATGCC 
1 + - + + + + + 60 

CTCTTAAAAAAGGATTCTAGTGGCGAAGAATCCTATAAGCAAGAAATAATTTTAATACGG 

Nsil 

Dpnl CviRI | 

Sau3AI | Nlalll j 

II II 
CCAATAGAATAATAGATCATCTTATCAAACTGCTTTTGTCATGCATAAAGTAATAGTTTT 
6X + + + + + + 12Q 

GGTTATCTTATTATCTAGTAGAATAGTTTGACGAAAACAGTACGTATTTCATTATCAAAA 

Msel CviJi 

i i 

CATTTTCCTTACCCTATATTCGTTAAAAAGTTATGGGAATGATGTAATAGATAAGCCCCA 
121 + + + . + + + 180 

GTAAAAGGAATGGGATATAAGCAATTTTTCAATACCCTTACTACATTATCTATTCGGGGT 

Bsal 
BsmAI 

Apol Earl | 

Nlalll Tsp509I Bfal Tsp509I | j 

I II III 
TGTTCTTGTCAGTATCGCCCCCTATAAATTCCTAGTTGAACAAATTGCTGAAGAGACCTG 
181 + + + + + + 240 

ACAAGAACAGTCATAGCGGGGGATATTTAAGGATCAACTTGTTTAACGACTTCTCTGGAC 

Dpnl 
Sau3AI | 

Alwl | j BbvCI 

Hinfl | j j BseRI BpulOI 

MboII Eco57I Maelll Tfil | j j MslI | Ddel 

I I I II II II I 

TTTTGTCTATGCGATAGTTACGAATCACTATGATCCCCATACCTATGAACTTCCTCCTCA 

241 + + + - - + + + 300 

AAAACAGATACGCTATCAATGCTTAGTGATACTAGGGGTATGGATACTTGAAGGAGGAGT 

Maelll 
BseMIlj 

Mnll | | Bsal NlalV 
Mnll | || BsmAI Drdll| Mnll 

I I II I II I 
GCAAATCAAGGAGTTACGACAAGGAGACCTTTGGTTCCGTATAGGAGAGGCATTTGGAAA 
301 + + + + + + 350 

CGTTTAGTTCCTCAATGCTGTTCCTCTGGAAACCAAGGCATATCCTCTCCGTAAACCTTT 
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Figure 4 (continued) 

Dpnl 

CviRI CjePl| Hinfl 

Nlalll Sau3Al|| Tfil 
Nspl Taql | | | BsmAI | 

I I I I I I I 

AAACTTGTTAGAGAAACCTTACATGCAACAAGTCGATCTTTCCCAAAATGTCTCGCTGAT 

361 + + + + + + 420 

TTTGAACAATCTCTTTGGAATGTACGTTGTTCAGCTAGAAAGGGTTTTACAGAGCGACTA 

CviJI 

CviJI Pf 111081 Tagil] 

CjePI | Cjel | BslI Msel | | 

II II I I II 

TCAAGGAAAG CCTTGCTGTAATCAACATAC CACGAACTACGACACCCACACTTGGTTAAG 

421 + + + + + + 480 

AGTTCCTTTCGGAACGACATTAGTTGTATGGTGCTTGATGCTGTGGGTGTGAACCAATTC 

Msel Maelll 
RleAI Cjel | BsmAI Cjel | Msel 

I II I II I 
CCCTAAAAACCTTAAAGTCCAAGTGGAGACTATCGTTACCACTTTAAGTAAAAAATATCC 
481 + + +■ + + + 540 



HaelV 
Hin4I 

Hinfl 

Mnll 



Thai 
Bsbl | 
Cjel | 
Plel 



Avail 
Sau96I 
Alul | 
CviJI | Mnll 



541 



BsrDI 
I 

TCAACACGCGACTCTATATCAAAGCAATGGAGAGAAACTTCTGTTAGCTTTGGACCAACT 

+ + h (- h + 

AGTTGTGCGCTGAGATATAGTTTCGTTACCTCTCTTTGAAGACAATCGAAACCTGGTTGA 



600 



BsaJI 
BstDSI 

Apol Ncol 
Tsp509I Mnll Styl 

I I I 

CAATGAGGAAATTCTTACGATTACCTCCAAAGCGAAACAACGCCATATTTTAGTTTCCCA 

601 + + + + + + 660 

GTTACTCCTTTAAGAATGCTAATGGAGGTTTCGCTTTGTTGCGGTATAAAATCAAAGGGT 
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. ~5 



Beef I 
CviJI 
Xcml 
NlalVj 
NlaXII | | 
I I! 



BseMII 
Sfcl I 



Tsp509I Ddel 

II II 

TGGAGCCTTTGGGTATTTTTGCCGTGATTACAATTTCTCTCAGCACACTATAGAGAAAAG 



€61 



-+ 720 



ACCTCGGAAACCCATAAAAACGGCACTAATGTTAAAGAGAGTCGTGTGATATCTCTTTTC 



CviJI 
Nlalll | 



Hpyl78III 
Thai | 
Cac8I | Maelllj 
CviJI | j Tsp45l| 
Ml II 



MboII 
Rsal j 
Taal | | 
Tat I j j 
I II 



CAGTCATGTTGAGCCTTCTCCTAAAGATGTGGCTCGCGTATTTCGTGACATTGAACAGTA 



721 



-+ 780 



GTCAGTACAACTCGGAAGAGGATTTCTACACCGAGCGCATAAAGCACTGTAACTTGTCAT 



Apol 
Tsp509I 



Hpyl78III 

Hinfl | MboII Cac8I 

MboII Tfil Taql Hpyl78III Bbsl | Mwol I 

I I I I ! I I II 

CAAAATTTCTTCTGTGATTCTTCTCGAATACTCTGGAAGACGAAGTAGTGCTATGCTGGC 

781 + + + + + + 840 

GTTTTAAAGAAGACACTAAGAAGAGCTTATGAGACCTTCTGCTTCATCACGATACGACCG 



Dpnl 
Sau3AI 
Hpyl78III 
Alwl 
Hinfl 
Tfil 
Taal 
Bcgl 
Nsil 



Dpnl 
Sau3AI | 
I I 



CviRI | 

Nlalll | 

Nspl | 

I I 



Taql 

I 



Acil 



Bcgl 
Rsal | 
Tat I | | 
I I 



AGATCGTTTCCACATGCATACTGTGAATCTCGATCCCTATGCGGAAAATGTACTTGTAAA 
841 + + + + + + 900 

TCTAGCAAAGGTGTACGTATGACACTTAGAGCTAGGGATACGCCTTTTACATGAACATTT 



Msel 



Bfal 



Apol 
EcoRI 
Tsp509I 



CviJI 
Hael 
Haelll 
StuI 



I 



CTTAAAAAC CATAG CG ACGACTTTTTCTAGTTTATG AC AATACGAATTCTTG CTGAAGGC 
901 + + + + + + 960 

GAATTTTTGGTATCGCTGCTGAAAAAGATCAAATACTGTTATGCTTAAGAACGACTTCCG 
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Figure 4 (continued) 



Alul 
CviJI Eco57I 
Bfal |MaeIII | 
II II 



HaelV 
Hin4I 
NlalV | 
Avail | | 
Sau96I j | 
II I 



Nlalll 
Taqll | 
Hpyl78III | | 
Real | | | 
BsmFI | III 
II III 



CTAGCTTTCCGTTACGGAAGCAAGGGACCGAATATCATTCATGATGTTTCTTTCTCTGTC 

SSI + + + + + + 1020 

GATCGAAAGGCAATGCCTTCGTTCCCTGGCTTATAGTAAGTACTACAAAGAAAGAGACAG 



Hinf I 
Tfil 



Mnll 
Avail | 
Sau96I | 



f=5 



Bed Tfil Sau96I | BslI Msel 

I I 

TATGATGGCGACTTTATAGGAATCATAGGACCAAACGGAGGGGGGAAAAGCACCTTAACG 

1021 + + + + - + + 1080 

ATACTACCGCTGAAATATCCTTAGTATCCTGGTTTGCCTCCCCCCTTTTCGTGGAATTGC 

Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl8 8IX| 

Tsp509I Cac8I BslI | | 

Msel | CviJI | Alwl | || I Alwl 

II II I I II 

ATGTTAATTTTGGGCTTGCTTACTCCTACATTCGGATCCTTGAAGACTTTCCCTTCGCAT 

1081 + -f + + + + 1140 

TACAATTAAAACC CGAACGAATGAGGATGTAAGC CTAGG AACTTCTGAAAGGGAAGCGTA 



BsmI 

Bbsl Faul 
Xmnl |Sthl32l| 
| |MboII | | 
II III 



SacII 
Acil | 
MspAlI | 
Thai j 
Acil | | 
BsaJI j | 
BstDSI | j 

I II 

TCCGCGGGGAAACAAACCCATT 

1141 + + -- 1162 

AGGCGCCCCTTTGTTTGGGTAA 
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Figure 5 

Restriction enzyme analysis of CPN100711 (RY 59 - SEQ ID NO. 5) 

BslI 
Dpnl 
Sau3AI 
ScrFI 
Apal 
Banll 
Bspl286I 
BsaJI 
EcoRII 

Bmgl 
BseSI 
CviJI 
Haelll 
NlalV 

Sau96I | | | | | | Hinf I 

Sau96I | | I | | | I Mmel 
Cjel Mill! I I Alwl Cjel Mnll Tfil 

I MINI Ml I I I 

ACAATCACTATGGG CCCAGG ATCGGTTCTTTCCAACC ATAG CAAAGAAG C AG GAGG AATC 

1 + + + + + + 

TGTTAGTGATACCCGGGTCCTAGCCAAGAAAGGTTGGTATCGTTTCTTCGTCCTCCTTAG 

Xmnl CviRl Taal 

I l l 

G CTATAAACAATGTC ATCATTGATTTTAGTG AAATCGTTCCTACTAAAGATAATG CAACA 
-t (. h a h 

CGATATTTGTTACAGTAGTAACTAAAATCACTTTAGCAAGGATGATTTCTATTACGTTGT 



60 



61 



120 



Alul 

CviJI Hpyl78III 
BsaXI | Tsp509I TaqI | 

Mwol| | Msel |TaqII | | CviRI 

III II I II I 

GTAG CTC CACCCACTCTTAAATTAGTAT CG AGAACTAATG C AGAT AGTAAAG ATAAGATT 

121 + + + + + + 180 

CATCGAGGTGGGTGAGAATTTAATCATAGCTCTTGATTACGTCTATCATTTCTATTCTAA 
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Figure 5 (continued) 



Dpnl 
BstYI 
Sau3AI 
Earl 
Hpyl78III 
Hinf I 
Ppil 



Maelll I 
Taalj 
Tsp45I j 
AlwNI | j 
MboH j j 
Plel| || 
II II 



Bfal 
Xbal| 
Alwl | j 

I II 



Apol 
Tsp509I 



GATATTACAGGAACTGTGACTCTTCTAGATCCTAATGGCAACTTATATCAAAATTCTTAT 

181 -*- + + + + + 240 

CTATAATGTCCTTGACACTGAGAAGATCTAGGATTACCGTTGAATATAGTTTTAAGAATA 



MboII 
EcoRV 
HphI 
Bbsl | 
Thai | 
Acil | j 
I I I 



Maelll 

Tsp509I CviRI Mwol | 

I ill 



CTTGGTGAAGACCGCGATATCACTCTTTTCAATATAGACAATTCTGCAAGTGGGGCAGTT 

241 -+ + + + + + 300 

GAACCACTTCTGGCGCTATAGTGAGAAAAGTTATATCTGTTAAGACGTTCACCCCGTCAA 



HphI 
CviJI |MaeIII 
Mwol | |Tsp45I 
III I 



Apol 
Tsp509I 
BslI | 



Alul 
CviJI 



ScrFI 
ECORII | 

Nlaiv| j 

1 1 



AC AGCC ACGAATGTCAC C CTTCAAGGGAATTTAGG AG CTAAAAAAGGATATTTAGGAAC C 
301 + + + + + + 360 

TGTCGGTGCTTACAGTGGGAAGTTCCCTTAAATCCTCGATTTTTTCCTATAAATCCTTGG 
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Figure 5 (continued) 



Aval 
BsaJl 

Alwl 
Apol 
Tsp509I 
Sthl32l| 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl | 

Apol | || ((I || Tsp509I Avail 

Tsp509l| || I)) || Mnll | Sau96I Cjel 

I I I 

TGGAATTTGGATCCAAATTCCTCGGGTTCATUUUSlTTATTCTAAAATGGACCTTTGACAAA 

2S1 + + + + + + 420 

ACCTTAAACCTAGGTTTAAGGAGCCCAAGTTTTTAATAAGATTTTACCTGGAAACTGTTT 



CviJI 
Haelll 
BspMI 
Sau96I 

CacSI I I Cjel 
Hhal I I j Bfal | 

Fokl | III BsmAI || Cjel 

I I III III I 

TACCTGCGCTGGCCCTACATCCCTAGAGACAACCACTTCTACATCAACTCTATTTGGGGA 

421 + + --+- + + + 480 

ATGGACG CGAC CGGG ATGTAGGGATCTCTGTTGGTGAAGATGTAGTTGAGATAAAC C C CT 



Ddel 
Dpnl 
BstYI 
Sau3AI 
BsaJI 

Maelll Styl 
BsiHKAI Tsp45I Drdll | 

Bspl286I Cjel | Taal | j | | |AlwI |NspI CviRI 

I II I II I I I I I I I 

GCACAAAACTCTTTAGTGACTGTGAACCAAGGGATCTTAGGGAACATGTTGAACAATGCA 

481 + + + + + + 540 

CGTGTTTTGAGAAATCACTGACACTTGGTTCCCTAGAATCCCTTGTACAACTTGTTACGT 



Nlalll 
AflHI | 
BspLUllI j 
Alwl | Nspl 



Cjel 
Dpnl 
BstYI | 
Sau3AI j 
Alwl | | 

I I I 



Dpnl 
Cjel 
BstYI | 
Sau3AI j 
Sf cl | | 
CviJI CviJI | | | 



Bsu36I 
Ddel 
Alwl | 

I I 



541 



MboII 

I I 
AGGTTTGAAGATCCTGCTTTCAACAACTTCTGGGCTTCGGCTATAGGATCTTTCCTTAGG 

+ + + + + + 

TCCAAACTTCTAGGACG AAAGTTGTTGAAGAC C CGAAGCCGATATC CTAGAAAGG AATC C 



600 
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Hinf I 
Cjel| 
Hphlj 
Hpyl88IX| 
Plel 
Apol | 
Tsp509I j 
Hpyl78III | | 
Taql | | 
I I I 



Fnu4HI 
Cjel| 
MspAlI | 

Tselj 
Sf aNI | j 
Acil| | | 
I I I I 



CO 

cn 

. f\ 



Bbvl 
Nlalll | 
Mnll | |CviJI 

III I 

AAAGAAGTATCTCGAAATTCTGACTCATTCACCTATCATGGCAGAGGCTATACCGCTGCT 

601 + + + + + + 660 

TTTCTTCATAGAGCTTTAAGACTGAGTAAGTGGATAGTACCGTCTCCGATATGGCGACGA 

EC057I 

Bbvl | 

Acelll | | 

Mnll | j Fnu4HI 

Apol | j j Alul | 

Tsp509l|| | Cvi JI | Maelll 

Mwol Fokl III | Tsel | Tsp45I 

i i in I- M i 

GTGGATGCCAAACCTCGCCAAGAATTTATTTTAGGAGCTGCCTTCAGTCAGGTTTTTGGT 

661 + + + + + + 720 

CACCTACGGTTTGGAGCGGTTCTTAAATAAAATCCTCGACGGAAGTCAGTCCAAAAACCA 

Maelll 
Tsp45I 
BpulOI I 
Ddel j 
CviJl| j BseMII 
III I 



Hpyl88IX 
HphI | 
Hinf I | j Plel 
III I 



CACGCCGAGTCTGAATATCACCTTGACAACTATAAGCATAAAGGCTCAGGTCACTCTACA 

721 + + + + + + 780 

GTGCGGCTCAGACTTATAGTGGAACTGTTGATATTCGTATTTCCGAGTCCAGTGAGATGT 

CacSI CviJI 

MboII Haelll 

Tthlllll| Taal Bsal | 

SfaKI || Hin4l| BsmAlj 

I II II I I 
CAAGCATCTCTTTATGCTGGCAATATCTTCTATTTTCCTGCGATACGGTCTCGGCCTATT 
781 + + + + + + 840 

GTTCGTAGAGAAATACGACCGTTATAGAAGATAAAAGGACGCTATGCCAGAGCCGGATAA 

BsaJI BslI 
Styl PflMI CviRI Nlalll 

II II 
CTATTCCAAGGTGTGGCGACCTATGGTTATATGCAACATGACACCACAACCTACTATCCT 

841 + + + + + + 900 

GATAAGGTTCCACACCGCTGGATACCAATATACGTTGTACTGTGGTGTTGGATGATAGGA 
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BsrDI 
BsrI | 

Tthlllll | I Dpnl 
MboII Bmrl| | j Sau3AI | 

I Mil II 
TCTATTG AAGAAAAAAATATGG CAAACTGGGATAGCATTG CTTGGTTATTTG ATCTG CGT 
901 + + + + + + 96Q 

AGATAACTTCTTTTTTTATACCGTTTGACCCTATCGTAACGAACCAATAAACTAGACGCA 



Alwl 
Msel 
Dpnl | 

BstYI | | | Mnll 
Sau3AI j I I Sfcl | 

TspRI j j j Mnll | j CviJI . BseMU 

I I I I III I I 

TTCAGTGTGGATCTTAAAGAACCTCAACCTCACTCTACAGCAAGGCTTACCTTCTATACA 
961 + + + + + + 1020 

AAGTCACACCTAGAATTTCTTGGAGTTGGAGTGAGATGTCGTTCCGAATGGAAGATATGT 



Apol 
ECORI 
Tsp509I 
Bst217I 
Ddel 



Alul | 
AlwNI | 
CviJI | 

II 



AccI 



Dpnl 
Bglll 
BstYI 
Sau3AI 
Bfal 
HaelV 
Hin4I 
Dpnl 
Sau3AI 



Apol 
Tsp509I 
ScrFI | 
EcoRII I 



Alwl 
Bfal | 
Alul | j 
CviJI 



II I II I 

GAAGCTGAGTATACCAGAATTCGCCAGGAGAAATTCACAGAGCTAGACTATGATCCTAGA 

1021 + + + + + + 1080 

CTTCGACTCATATGGTCTTAAGCGGTCCTCTTTAAGTGTCTCGATCTGATACTAGGATCT 



Nlalll 
Nspl 
SphI 
Cac8I | 
CviRI | j 

I I I 



Tsp509I 
Ddel | 



BsrI 
Hinfl| AccI 
Tfilj Sfcl | 
I I 



TCTTTCTCTGCATGCTCTTATGGAAACTTAGCAATTCCTACTGGATTCTCTGTAGACGGA 
1081 + + + + + + H40 

AGAAAGAGACGTACGAGAATACCTTTGAATCGTTAAGGATGACCTAAGAGACATCTGCCT 
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Alul 
CviJI 



Bbvl 



Fnu4HI 

Alul | 
CyiJI | 
MspAlI | 
PvuII | 

Tsel j Rsal 

I 



Hinf I 
Tf il 



GCATTAGCTTGGCGTGAGATTATTCTATATAATAAAGTATCAGCTGCGTACCTCCCTGTG 



1141 



-+ 1200 



CGTAATCGAACCG CACTCTAATAAGATATATTATTTCATAGTCG ACG CATGGAGGGACAC 

Hpyl78III 
Ddel | 

Mnll | j BseMII Maell 

I I I I | 

ATTCTCAGGAATAATCCAAAAGCGACCTATGAAGTTCTCTCTACAAAAGAAAAGGGCAAC 



1201 



■+ 1260 



TAAGAGTCCTTATTAGGTTTTCGCTGGATACTTCAAGAGAGATGTTTTCTTTTCCCGTTG 



Bsgl 
HphI 
Apol 
Tsp509I 
Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul 
CviJI 
Hin4I 
Acelll 
Bbvl 
CviRI 
BstAPI 
Mwol 
Mnll 
BssSI | 
Alul | 

Acll CviJI j 

Maell Fnu4HI | j 

Hindi | Tsel | j | 

I I II II 
GTAGTCAACGTTCTCCCTACAAGAAACGCAGCTCGTGCAGAGGTGAGCTCTCAAATTTAT 
1261 + + + + + + X320 

CATCAGTTGCAAGAGGGATGTTCTTTGCGTCGAGCACGTCTCCACTCGAGAGTTTAAATA 
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Maelll BsrI 
BplI | BspGI | 

II II 



BstZ17I 
AccI | 
Sf aNI | I 
BsaAI | | I 
Maelll I I I 



Beef I 



1321 



CTTGGAAGTTACTGGACACTCTACGGCACGTATACTATTGATGCTTCAATGAATACTTTA 

+ + + + ,_ __ + + 13Qo 

G AACCTT CAATG ACCTGTGAG ATGCCGTG C ATATG ATAACTACGAAGTTACTTATGAAAT 



CviJI 
Hael 
Haelll 
MnlX 
Mscl 



Mspl 
BsaWI 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
BslI I 



CviRI Eael | Alwl | 



Alwl 



Msel 
Tsp509I | 
BstZ17I | | 
Bfal Accl! I I 



1381 



I M I I I III I I II I I 

GTGCAAATGGCCAACGGAGGGATCCGGTTTGTATTCTAGGGTATACAATTAAAGATTTTA 

+ + + + + + 1440 

C ACG TTTAC CGGTTGC CTCC CTAGG C CAAACATAAG ATCC CATATGTTAATTTCTAAAAT 



NspV 

TaqI Taal 
Hinfl | BsaJI | 
Tfil BstDSI I 



Hindi 
Hpal 
Msel 
Thai 
Afllll 
Mlul 
Rsal 



CjePI 



BciVI 
Tsp509I | 
Mnll | j 

mi i i 

TGAAATTGAGGATACGGAGAGAGTGGGATTCGAACCCACGGTACGCGTTAACGCACACAC 
1441 + 

ACTTTAACTCCTATGCCTCTCTCACCCTAAGCTTGGGTGCCATGCGCAATTGCGTGTGTG 



+ 1500 



CjePI 
Hpyl8 8IX 
CviJI 
Mwol 



Mwol 



Msel 
Aflll 
Smll 
BsiHKAI 
Bspl286I 
Cac8I | 

I I 
I I 



1501 



GCTTTCCAAGCGTGCTCCTTAAGCCACTCGGACATCTCTCCATATTTATA 



-+ 1550 



CGAAAGGTTCGCACGAGGAATTCGGTGAGCCTGTAGAGAGGTATAAATAT 
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Figure 6 

Restriction enzyme analysis of CPN100877 (RY 61 - SEQ ID NO. 6) 

Cac8I 
Mwol | 

Maelll CviJl| 
Tsp45I Apol BsiHKAI | | 

Msel | Tsp509I Bspl286I j j 

II I I II 
AATTCTTTTTAAGTGACAAGAAATTCTTGTGCTCGGCTTGCTTTCTTATTCTTATTGACG 
! + + + + + + 60 

TTAAGAAAAATT C ACTGTTCTTTAAGAACACGAGCCGAACGAAAGAATAAGAATAACTG C 

Hpyl8 8IX 
Dpnl | 

Bell | | Hin4I 
Sau3AI | | Rsal Mwol Acil 

III I II 
-Jf TATTG CTTG ATC AGATATTCATTTTG ATTTAGGTACTAAAATGCGATTTT CGCT CTG CGG 
\Q si + + + + + + 12 o 

ATAACGAACTAGTCTATAAGTAAAACTAAATC CATG ATTTTACG CTAAAAG CGAGACGC C 

L; = 

CO Ddel 

£Q MboII j BseMII 

:~ Bfal Mnll BsrDI | j TaqI | 

I I I I I I I 

ATTTCCTCTAGTTTTTTCTTTTACATTGCTCTCAGTCTTCGACACTTCTTTGAGTGCTAC 

121 + + + + + ■+* 180 

TAAAGGAGATCAAAAAAGAAAATGTAACGAGAGTCAGAAGCTGTGAAGAAACTCACGATG 



Acll 
Mae 1 1 

Nlalll BsmI | 

Pflll08I Msel MboII | Hpyl88IX CviRI | | 

I I II I I I I 

TACGATTTCTTTAACCCCAGAAGATAGTTTTCATGGAGATAGTCAGAATGCAGAACGTTC 

181 + + + + + + 240 

ATGCTAAAGAAATTGGGGTCTTCTATCAAAAGTACCTCTATCAGTCTTACGTCTTGCAAG 

Fokl Bcgl 
Alul CviJI | HphI TaqI 

CviJI Sfcl | | Bsrl BsmAl| Maell | 

I III I II I I 

TTATAATGTTCAAGCTGGGGATGTCTATAGCCTTACTGGTGATGTCTCAATATCTAACGT 

241 + + + + + + 300 

AATATTACAAGTTCGACCCCTACAGATATCGGAATGACCACTACAGAGTTATAGATTGCA 
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Msel 
CviRI | 
I I 



Cac8I 
Bcgl| 
Cvi JI | j 
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BseMII 
Hpyl78III Maell 
Bsu36I |MaeIII | 
Maelll | j Mnll j 

Tsp45I Ddel |Tsp45I j 



I 



CGATAACTCTGCATTAAATAAAGCCTGCTTCAATGTGACCTCAGGAAGTGTGACGTTCGC 



301 



-+ 360 



GCTATTGAGACGTAATTTATTTCGGACGAAGTTACACTGGAGTCCTTCACACTGCAAGCG 



CQ 
m 

m 



H 



Hpyl7 8III 

Bsu36I | BseMII 
Nlalll Msel Sspl Ddel j Mnll | CviJI 

I I I II II I 

AGGAAATCATCATGGGTTATATTTTAATAATATTTCCTCAGGAACTACAAAGGAAGGGGC 



361 



-+ 420 



TCCTTTAGTAGTACCCAATATAAAATTATTATAAAGGAGTCCTTGATGTTTCCTTCCCCG 



Tthlllll 
Maell | 
Mnll | j Bcefl 



Smll 
Dpnl 

Bce83I BstYI | 

Rsal | Sau3AI | 

TatI | j Alwl | j 

III I II 

TGTACTTTGTTGCCAAGATCCTCAAGCAACGGCACGTTTTTCTGGGTTCTCCACGCTCTC 

421 + + + + + + 480 

AC ATGAAACAACGGTTCTAGGAGTTCGTTGC CGTG CAAAAAG AC C CAAG AGGTG CGAGAG 

Msel 
Sthl32I 
Mspl 
Neil 
ScrFI 
Banll| 
Bspl286I | 
BsaJl| j 

CviJI ||j || FoJcI 

Hpyl8 8IX | | | | | | BsmAI | CviRI 

I I I I I 

TTTTATTCAGAGCCCCGGAGATATTAAAGAACAGGGATGTCTCTATTCAAAAAATGCACT 
481 + + + + + + 540 



Tsp509I Ecil 
Msel | Acil| 

II II 
TATGCTCTTAAACAATTATGTAGTGCGTTTTGAACAAAACCAAAGTAAGACTAAAGGCGG 

541 + + + + + + 600 

ATACGAGAATTTGTTAATACATCACGCAAAACTTGTTTTGGTTTCATTCTGATTTCCGCC 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39158 



09/868987 

PCT/CA99/01230 



Figure 6 (continued) 



29/96 



Hin4I 
Hinfl | 

Alul 

CviJI Maelll Sfcl Pflll08I | | 

I II III 

AGCTATTAGTGGGGCGAATGTTACTATAGTAGGCAACTACGATTCCGTCTCTTTCTATCA 



Hpyl8 8IX 
Tfil | BsmAI | 
BsmBI j 



601 



660 



TCGATAATC AC C CCG CTTACAATGATATC ATCCGTTGATGCTAAGG C AG AGAAAG ATAGT 



Mill I 
CviJI 
BsmI 



Fnu4HI | 
CviRI | j 
Tsel | j 
III 



BstoFI 
MboII 
Hin4I | 
Eco57I | | 
Bbvl | j | 

II I I 



NlalV 
Avail 
Eco0109I 
PspSII 
Sau96I 



Sfcl 



CviRI 
I 



GAATGCAGCCACTTTTGGAGGTGCTATCCATTCTTCAGGTCCCCTACAGATTGCAGTAAA 

661 + + + + + + 720 

CTTACGTCGGTGAAAACCTCCACGATAGGTAAGAAGTCCAGGGGATGTCTAACGTCATTT 



Rsal 

CjePI Hpyl78III Cjel | 

Cjel | Drdll | TatI | | 

CviRI | j Mnll| j CviJI ||| 

III II I I III 

TGAGGC AG AGATAAG ATTTGCACAAAATACTG CCAAGAATGGTTCTGGAGGGG CTTTGTA 

721 + + + + + + 780 

AGTCCGTCTCTATTCTAAACGTGTTTTATGACGGTTCTTACCAAGACCTCCCCGAAACAT 



Bed 
Bpml | 
Hpyl88IX [ j 
CjePI | | | 

I I II 



Hpyl8 8IX 

Dpnl 
Bell 
Sau3AI 
BsaBI | 
HphI | | 
III 



BsmI 



Mill I 
Hpyl78III | 
Taql | | 



CTCCGATGGTGATATTGATATTGATCAGAATGCTTATGTTCTATTTCGAGAAAATGAGGC 
781 + - + + - + + + 840 

GAGGCTAC CACTATAACTATAACTAGTCTTACGAATACAAGATAAAGCTCTTTTACTC CG 



ECOS7I 
Bbsl | 
MboII | 

CviJI | | Hpyl78III 
BsaXl| || BslI | 

Sfcl Mnll Hin4l| || Bpml | j TatI 

II II II III I 

ATTGACTACTGCTATAGGTAAGGGAGGGGCTGTCTGTTGTCTTCCCACTTCAGGAAGTAG 

841 + + + + + + 900 

TAACTGATGACGATATCCATTCCCTCCCCGACAGACAACAGAAGGGTGAAGTCCTTCATC 
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Figure 6 (continued) 



Bsri 
Rsai | 
Seal 



Maelll 
Tsp45I 



Taqll 
XmnI | 



Hpyl8 8IX Taal Cjel 

111 I IN 
TACTCCAGTTCCTATTGTGACTTTCTCTGACAATAAACAGTTAGTCTTTGAAAGAAACCA 
901 + + + + + + 960 

ATGAGGTCAAGGATAACACTGAAAGAGACTGTTATTTGTCAATCAGAAACTTTCTTTGGT 



Avail 
Eco0109I 
PspSII 
Sau96I 
Sse8647I 

CviJT Eco57I Earl 

NlaIV| Bfal | Hpyl78III 

Ecil || Cjel | j MboII Sf aNI | 

Acil | | | Mwol | | j Ddel | Mnll | | 

II II I I I I II I II 

TTCCATAATGGGTGGCGGAGCCATTTATGCTAGGAAACTTAGCATCTCTTCAGGAGGTCC 

961 + + + + + + 1020 

AAGGTATTACCCACCGCCTCGGTAAATACGATCCTTTGAATCGTAGAGAAGTCCTCCAGG 



1021 



Apol 
Tsp509I 

CviRI | Apol Alul 

Ndel | j Tsp5 09I CviJI 

III I I 
TACTCTATTTATCAATAATATATCATATGCAAATTCGCAAAATTTAGGTGGAGCTATTGC 
+ + + + + + 

ATGAGATAAATAGTTATTATATAGTATACGTTTAAGCGTTTTAAATCCACCTCGATAACG 

Dp ill 
Sau3AI | 

Hin4I | | BsaJI 
Mnll Bsri | | | Bpml Tsp509I Styl 

I I I II I I I 

CATTGATACTGGAGGGGAGATCAGTTTATCAGCAGAGAAAGGAACAATTACATTCCAAGG 



1080 



1081 



1140 



GTAACTArGACCTCCCCTCTAGTCAAATAGTCGTCTCTTTCCTTGTTAATGTAAGGTTCC 



Hpyl78III 
Apol | 
Tsp509I | 



Mspl Alul Taal SfaNI 
BsaWI| CviJI Fokl| Bed | 

II I II II 

AAAC CGGACGAG CTT AC CGTTTTTGAATGG CATCCATCTTTT AC AAAATG C TAAATTCCT 

1141 + + + + + + 1200 

TTTGGCCTGCTCGAATGGCAAAAACTTACCGTAGGTAGAAAATGTTTTACGATTTAAGGA 
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Tsp509I 
I 



BciVI 



I 



Dpnl 
Sau3AI 
Alwl 



Apol 
Tsp509I 
Sfcl | 
I I 



Hpyl8 8IX 
I 



GAAATTACAGGCGAGAAATGGATACTCTATAGAATTTTATGATCCTATTACTTCTGAAGC 

1201 + + + + + + 1260 

CTTTAATGTC CG CTCTTT ACCTATGAGATATCTTAAAATACTAGG ATAATGAAG ACTTCG 



EC057I 
Muni 
Tsp509I 
Accl | 
Siml | j 
Bed | | | 
I I I I 



Dpnl 
BstYI | 
Sau3AI | 
Alwl | j 



Rsal 
Tat I | 
I I 



NlalV 
Avail) 
Sau96I j 



AGATGGGTCTACCCAATTGAATATCAACGGAGATCCTAAAAATAAAGAGTACACAGGGAC 



1261 



■+ 1320 



TCTACCCAGATGGGTTAACTTATAGTTGCCTCTAGGATTTTTATTTCTCATGTGT CCCTG 



Bfal 
Avrll 
BsaJI 
Styl 
Dpnl 
Sau3AI | 
Bpml | j 
Plel Ml || Dral 
Alwl | j || | | | HaeIV| 
Hpyl78III Bfal || jj j || Hin4I| 

BsmFI | Hinf I | j j j j j | | Msel | 

II I I II II I I! II 

CATACTCTTTTCTGGAGAAAAGAGTCTAGCAAACGATCCTAGGGATTTTAAATCTACAAT 

1321 + + + + + + 1380 

GTATGAGAAAAGACCTCTTTTCTCAGATCGTTTGCTAGGATCCCTAAAATTTAGATGTTA 



PstI 
CviRI 
Sf CI 
BciVI 



BseMII 
Hindi 
Mnll 
Maell | 
Hpyl8 8IX | j 
Ddel | j | 

II II 



Maelll 
Tsp45I 
CviJT 
Haelll 
NlalV] 
Sau96I | j 



Msel 
Ddel Mnll | 
I I i 

CCCTCAGAACGTCAACCTGTCTGCAGGATACTTAGTTATTAAAGAGGGGGCCGAAGTCAC 

1381 + + + + + + 1440 

GGGAGTCTTGCAGTTGGACAGACGTCCTATGAATCAATAATTTCTCCCCCGGCTTCAGTG 
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Apol 
Tsp509I 
Taal Bpml | 

I i I 



Dpnl 
Sau3AI 



BsmAI 
ScrFI 
EcoRII | 
BsaXI | | 
I I I 



Drdll 
NlalVi 



Alwl 

I M 

AGTTTCAAAATTCACGCAGTCTCCAGGATCGCATTTAGTTTTAGATTTAGGAACCAAACT 
1441 + + + + + + 150Q 

. TCAAAGTTTTAAGTGCGTCAGAGGTCCTAGCGTAAATCAAAATCTAAATCCTTGGTTTGA 



in 

Ill 



f ~1 



1501 



Alul 
CviJI 
Msel 

Hpyl78III Aflll 
CviJI | smll 
Bbsl Hael j Alul | 

Ddel BsrDI |MboII Haelll Nrul CviJI | 

CviJI | Mnll | | Bed | StuI Thai Mnll Fokl || 

II I I I II II I I II 

G ATAGC CTCTAAGGAAG ACATTGCCATCACAGG CCTCGCGATAGATATAG ATAG CTTAAG 



- + 1560 



CTATCGGAGATTCCTTCTGTAACGGTAGTGTCCGGAGCGCTATCTATATCTATCGAATTC 

Alul 
AlwNI 
CviJI 
MspAlI 
PvuII 
Mnll j 

Fnu4Hl|| Bbvl Acil 

Tsel||i Msel | Mwol | Tthlllll 

mi ii ii i j ii 

CTC ATC CT C AAC AG C AG CTG TTATTAAAG C AAACAC CG C AAATAAAC AG ATATC C GTGAC 



Plel 
MaeIII| 
Tsp45I | 
EcoRV | j 



1561 



-+ 1620 



GAGTAGGAGTTGTCGTCGACAATAATTTCGTTTGTGGCGTTTATTTGTCTATAGGCACTG 



Apol 
Tsp509I 
MboII 
Hpyl8 8IX 
Ddel 



Sfd 



Hinf I 



BsrI BsrDI 



Dpnl 
Bglll 
BstYI 
Sau3AI 

I 
I 



GGACTCTATAGAACTTATCTCGCCTACTGGCAATGCCTATGAAGATCTCAGAATGAGAAA 
1621 + + + + + + i6 80 

CCTGAGATAT CTTGAATAG AG CGG ATGACCGTTACG GATACTTCTAGAGTCTTACTCTTT 
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Neil 
ScrFI 
BsaJI 
Mspl 
BslI 
CviJI 
NlalV 
ScrFI 
CviJI 
EcoRII 

Hin4I Sthl3 2I 

BseMII Maell | Mnll | 

I II- I I 

TTCACAGACGTTCCCTCTGCTCTCTTTAGAGCCTGGAGCCGGGGGTAGTGTGACTGTAAC 
1681 + + + + + + 1740 

AAGTGTCTGCAAGGGAGACGAGAGAAATCTCGGACCTCGGCCCCCATCACACTGACATTG 



Maelll 
Maelll Taal 
Tsp4 5I Bpml | 
II 



BsmFI 



I 



Mspl 
BsaWI | 
BsrFI j 
PinAI | 
II 



Bpml 



BslI 



Alul 
CviJI 
Bsp24I 
Cjel 
CjePI 
Tsp509I | 
Muni | | 

Tsp509I | | 

I I I 



TGCTGGAG ATTTC CTAC CGGTAAGTCCC C ATTATGGTTTTCAAGG CAATTGGAAATTAGC 

1741 + + + + + + 1800 

ACGACCTCTAAAGGATGGCCATTCAGGGGTAATACCAAAAGTTCCGTTAACCTTTAATCG 



1801 



Apol 
Cjel 
EcoRI 
Tsp5 0 9I 
CjePI | 

CjePI Bsp24l|| CjePI Bfal 

Mmel AlwNI BsrI |MboII ||j Tsp509I | CviJI | 

I I I I I III I I I I 
TTGGACAGGAACTGGAAACAAAGTTGGAGAATTCTTCTGGGATAAAATAAATTATAAGCC 
h + + + + + 

AACCTGTCCTTGACCTTTGTTTCAACCTCTTAAGAAGACCCTATTTTATTTAATATTCGG 



1860 



Alol 
Apol | 
Tsp509I j RleAI 
I I I 



HaelV 
BsmI Hin4I 
Sfcl| Alwl | 
II II 



TAGACCTGAAAAAGAAGGAAATTTAGTTCCTAATATCTTGTGGGGGAATGCTGTAGATGT 
1861 + + + + + + 1920 

ATCTGGACTTTTTCTTCCTTTAAATCAAGGATTATAGAACACCCCCTTACGACATCTACA 
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Figure 6 (continued) 



sfaNl 
Alul 
CviJi 
Taql 
Nsil | 
CviRl | I 

iMsel I 1 I I Nlalll | j 

I II 

CAGATCCTTAATGCAGGTTCAAGAGACCCATGCATCGAGCTTACAGACAGATCGAGGGCT 

1921 + + + + - + + 1980 

GTCTAGGAATTACGTCCAAGTTCTCTGGGTACGTAGCTCGAATGTCTGTCTAGCTCCCGA 



BspMI 
Dpnl 
BstYI | 
Sau3AI | 
Hpyl8 8IX | 
I I 



SimI 
Hpyl78III 

Bsal 
BsmAI 
CviRI | 
I I 



Taql 
Dpnl | 
Sau3AI | | 
Mnll | J | CviJI 
I I II I 



Apol 
Tsp509I 
MboII | 
Tsp509I | | 

Clal | j | MlalV 

Taql | | j Rsal 

Dpnl | Alwl j j | Bbsl BanI | 

Sau3AI | j Bed | | j |XmnI Nlalll Hpyl88IX Mnll |MboIl| j 

I II III I I I I I I I II I 

GTGGATCGATGGAATTGGGAATTTCTTCCATGTATCTGCCTCCGAAGACAATATAAGGTA 

1981 + + + + + + 2040 

CACCTAGCTACCTTAACCCTTAAAGAAGGTACATAGACGGAGGCTTCTGTTATATTCCAT 



Taal Acil Dpnl BpulOI 

Kpnl | MspAlI Sau3AI | Ddel 

II I III 

CCGTC ATAACAGCGG TGGATATGTTCTATCTGTAAATAATGAGATCACAC CTAAG C ACTA 

2041 + + + + + + 2100 

GG C AGTATTGT CG CCACCTATACAAG ATAG AC ATTTATTACTCTAGTGTGG ATTCGTGAT 

Bed 

Taql | BsmAI Acil 

II I I 

TACTTCGATGGCATTTTCCCAACTCTTTAGTAGAGACAAGGACTATGCGGTTTCCAACAA 

2101 + + + + + + 2160 

ATGAAGCTACCGTAAAAGGGTTGAGAAATCATCTCTGTTCCTGATACGCCAAAGGTTGTT 



BslI 
Bfal| 
Avrll | j XmnI 
BsaJI j j Sspl | 
Styl | | Mnll | | 
III I II 

CGAATACAGAATGTATTTAGGATCGTATCTCTATCAATATACAACCTCCCTAGGGAATAT 

2161 + 4- + + + + 2220 

GCTTATGTCTTACATAAATCCTAGCATAGAGATAGTTATATGTTGGAGGGATCCCTTATA 



Alwl 
Hin4I 
Dpnl | 
Sau3AI | | 
Mmel | j | 
I I I I 
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Hinf I 
Tf il 
Hpyl78III | 
Maell | j MboII 
Maelll Bce83I | | |Hpyl78IIl| 

Thai | Sthl32I |j If Smll || 

II I I I I I I M 

TTTCCGTTATGCTTCGCGTAACCCTAATGTAAACGTCGGGATTCTCTCAAGAAGGTTTCT 



2221 



- + 2280 



AAAGG CAAT ACGAAG CGC ATTGGGATTACATTTGC AGCCCTAAGAGAGTT CTTCCAAAGA 



Mnll 



Nlalll 
I 



2281 



TCAAAATCCTCTTATGATTTTTCATTTTTTGTGTGCTTATGGTCATGCCACCAATGATAT 



AGTTTTAGGAGAATACTAAAAAGTAAAAAACACACGAATACCAGTACGGTGGTTACTATA 



■+ 2340 



Apol 
Tsp509I 



HphI 
Alul| 
CviJI j 
MspAlI | 
PvuII j 
Cjel| | 



Muni 
Tsp509I 
I 



Sf cl 
CviJI | 

I I 



2341 



GAAAACAGACTACGCAAATTTCCCTATGGTGAAAAACAGCTGGAGAAACAATTGTTGGGC 



- + 2400 



CTTTTGTCTGATGCGTTTAAAGGGATACCACTTTTTGTCGACCTCTTTGTTAACAACCCG 



BanI 

Nlalll MboII 
Nspl BsaJI 
Mwol SphI Sty I 

Mnll Cjel |Cac8I | Bbsl | 

Bpml |AciI | |Hin4I | Mnll BplI Fokl | | 

I I I I I I I I I 111 

TATAGAGTGCGGAGGGAG CATG CCTCTATTGGTATTTGAGAACGGAAGACTTTTCCAAGG 



2401 



- + 2460 



ATATCTCACGCCTCCCTCGTACGGAGATAACCATAAACTCTTGCCTTCTGAAAAGGTTCC 



Bed 
Ml a IV | 



TspS09I 



Bsp24I 
Cjel 
CjePI 
BsmAI | 
Nlalll BsmBI j Bcefl 
I 



TGCCATCCCATTTATGAAACTACAATTAGTTTATGCTTATCATGGAGATTTCAAAGAGAC 
2461 + + + + + + 2520 

ACGGTAGGGTAAATACTTTGATGTTAATCAAATACGAATAGTACCTCTAAAGTTTCTCTG 
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CviJI 
Haelll 
Bed 
Eael 
Gdill 
PstI | 
CviRI | j 
Sfcl | | | 
I I I I 



Cjel 
CjePI | 
Bsp24l | j 

III 



Clal 
Msel TaqI 



Rsal Bfal 



GACTGCAGATGGCCGTAGATTTAGTAATGGGAGTTTAACATCGATTTCTGTACCTCTAGG 



2521 



-+ 2580 



CTGACGTCTACCGGCATCTAAATCATTACCCTCAAATTGTAGCTAAAGACATGGAGATCC 



2581 



Fokl 

Cac8I BseMII | 

Alul | Hpyl78III Rsal | j 

Mnll CviJI | Ddel |TatI | | | 

I II I I I I I I 

CATACGCTTTGAGAAGCTGGCACTTTCTCAGGATGTACTCTATGACTTTAGTTTCTCCTA 



GTATGCGAAACTCTTCGACCGTGAAAGAGTCCTACATGAGATACTGAAATCAAAGAGGAT 



-4- 2640 



Bbvl 
Dpnl | 
NlalV 



Hpyl78III 



BamHI 
BstYI 
Sau3AI 
Alwl I 



2641 



Fnu4HI 
j Alul | 

| CviJI j 

j Nlalll Tselj 
| Alwl | Mnll | j 

I I I III 

TATTCCTGATATTTTCCGTAAGGATCCCTCATGTGAAGCTGCTCTGGTGATTAGCGGAGA 

+ H- + + + + 

ATAAGGACTATAAAAGG CATTC CTAGGG AGTAC ACTTCG ACGAG AC C AC TAATCG CCTCT 



Acil 
Plel | Hinf I 
BsroAI | |HphI | 



2700 



Hpyl7 8III 
BsaAI 
Mae 1 1 



CviJI 
ScrFI | 
EcoRII I 



Af 1III 
Fnu4HI | 
Tselj | 
Mspl | | | 



Bbvl 



Nlalll 
Nspl 



Sthl32I 



SimI 
BscGI I 



CTCCTGGCTTGTTCCGGCAGCACACGTATCAAGACATGCTTTTGTAGGGAGTGGAACGGG 
2701 + + + + + + 2760 

GAGGACCGAACAAGGCCGTCGTGTGCATAGTTCTGTACGAAAACATCCCTCACCTTGCCC 
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Mnll 
Banll 
BsiHKAI 
Bspl286I 

SacI | BsmI 
Alul | j Acilj 
BseMII CviJI j j Fnu4HI j 

Msel | Ddel | j j TaqI Taul j 

II I I I I I II 
TCGGTATCACTTTAACGACTATACTGAGCTCTTATGTCGAGGAAGTATAGAATGCCGCCC 
2761 + + + + + + 2820 

AGCCATAGTGAAATTGCTGATATGACTCGAGAATACAGCTCCTTCATATCTTACGGCGGG 



CO 



Apol 

Taal Tsp509I 



Cjel 



BsrDI 



Tsp509I 
Bf al | 
BslI j 
NlaIIl| | 

II I II II 

CCATG CTAGGAATTATAATATAAACTGTGGAAG CAAATTTCGTTTTTAGAAGGTTTCCAT 

2821 + + + + + 2880 

GGTACGATCCTTAATATTATATTTGACACCTTCGTTTAAAGCAAAAATCTTCCAAAGGTA 

Alwl 
Cjel 
Msel 
Dpnl 
BstYI 
Sau3AI 
Hpyl78III 
Mspl 
BsaWI | 
BspEI | 
NlaIV| j 
Xcml Drdll | j j 
I I I I I 



Dpnl 

BspGI Sau3AI | 
ScrFI |HaeIV | | 
EcoRII | |Hin4I | j 

I I I 



Alwl 
I 



TGCCTGTGTGGTTCCGGATCTTAACTATAAATCCTGGACTATGGATCATAGGCATTGGGT 

2881 + - + + + + + 2940 

ACGGACACACCAAGGCCTAGAATTGATATTTAGGACCTGATACCTAGTATCCGTAACCCA 



Hpyl78III 
TaqI 

I 

TTCTCGAACT 

2941 + 2950 

AAGAGCTTGA 
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Figure 7 

Restriction enzyme analysis of CPN100325 (RY 62 - SEQ ID NO. 7) 



BsiEI 
Pvul 
Dpnl | 
Sau3AI | | 

BsrDI Taql | j | 

I II II 
GTGGGGGCATTGCTGGGGGAAAAGCACATTTCGATCGCATTGATAATCTTATCAGTCCAA 
+ + j. h + + 

CACCCCCGTAACGACCCCCTTTTCGTGTAAAGCTAGCGTAACTATTAGAATAGTCAGGTT 



60 



U 

CO 
cn 

CO 

m 



La. 
r 

n 



BslI 
EcoNI 
Mnll 

CjePI ScrFI 
Hpyl78III | Bsll| 
Fokl | j EcoRIl) | 

CjePI Tthlllll SfaNI || j MboII| 

I I I II I I I I I . . 
AGCAACCAAGCAAAGAAAGGTGGTGGGGTTTATCTTGAAGATGCCCTCATCCTGGAAAAG 
61 + + + + + + 12Q 

TCGTTGGTTCGTTTCTTTCCACCACCCCAAATAGAACTTCTACGGGAGTAGGACCTTTTC 

Sfcl 
Alul| 
CviJll 
Fnu4HI | 
Cjel | | 
AlwNI BsmAI |Tsel| | | Bbvl 
I f I III 
GTTATTACAGGTTCTGTCTCACAAAATAGCAGCTACAGAAAGTGGTGGGGGTATCTACGC 
121 + + + + + + 180 

CAATAATGTCCAAGACAGAGTGTTTTATCGTCGATGTCTTTCACCACCCCCATAGATGCG 



BpulOI 
Ddel 
Cjel | 



Tsp509I 
- Alul 
CviJI 
Hindll I 
ScrFI 



EcoRII | 
Alul | | 
CviJI j j 



Taql 



TAAGGATATTCAACTACAAGCTCTACCTGGAAGCTTCACAATTACCGATAATAAAGTCGA 



181 



ATTCCTATAAGTTGATGTTCGAGATGGACCTTCGAAGTGTTAATGGCTATTATTTCAGCT 



■+ 240 
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Figure 7 (continued) 



Maelll 
Tsp45I 
Alul | 

Bfal Tsp509I BsrI CviJI | 

Spel| Bfal | Acelll SfaNI TspRI | | 

M II I I II I 

AACTAGTCTTACTACTAGCACTAATTTATATGGTGGGGGCATCTATTCCAGTGGAGCTGT 

241 + + -t- + + + 

TTGATCAGAATGATGATCGTGATTAAATATACCACCCCCGTAGATAAGGTCACCTCGACA 



300 



301 



NgoGV 
NlalV 

Hpyl78III | Fokl 

I I I 
CACGCTAAC C AATATATCTGGAAC CTTTG G CATTAC AGGAAACTCTGTTATCAATAC AG C 

-t + + H 1 + 

GTGCGATTGGTTATATAGACCTTGGAAACCGTAATGTCCTTTGAGACAATAGTTATGTCG 



360 



ScrFI 
BsaJI | 

EcoRII j Btrl BsmAI 

SfaNI | | CviRI Fokl CviRl Maell | BsmBI 

I I I I I I II I 

GACATCCCAGGATGCAGATATACAAGGTGGGGGCATTTATGCAACCACGTCTCTCTCAAT 

361 + + + + + + 420 

CTGTAGGGTCCTACGTCTATATGTTCCACCCCCGTAAATACGTTGGTGCAGAGAGAGTTA 

Taqll Fnu4HI 
Bbvl | Tsel | 

II II 

AAAT CAATGTAATACAC C CATTCTATTTAGCAACAACTCTGCTG CCACTAAAAAAACATC 

421 + + + + + + 480 

TTTAGTTACATTATGTGGGTAAGATAAATCGTTGTTGAGACGACGGTGATTTTTTTGTAG 



Tsp509I 



CviJI 
Bbvl | 
Mwol | j 
MboIl| j | 
II II 



Maelll 
PstI 
CviRI 
Fnu4HI 
Sfcl 
MspAlI | 
Tsel j 
Acil I j 

I II 



Hin4I 
Hpyl78III | 
Taql | | 
I I I 



AACAACAAAGCAAATTGCTGGTGGGGCTATCTTCTCCGCTGCAGTAACTATCGAGAATAA 

481 + + + + + + 540 

TTGTTGTTTCGTTTAACGACCACCCCGATAGAAGAGGCGACGTCATTGATAGCTCTTATT 
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CviJl 
Ddel | 



Tsp509I 
Msel 
Mmel | 
BseMIl| | 
Bpll|| | 
III I 



Acil Hpyl88IX 



Sfcl 
AlwNI | 
BstAPI | 
Fnu4HI | | 

Tsel| || 
MWOI || || 
Sf cl | | | Mwolj 
II II 



541 



CTCTCAGCCCATTATTTTCTTAAATAATTCCGCAAAGTCGGAAGCAACTACAGCAGCAAC 

H I- + H (- + 

GAGAGTCGGGTAATAAAAGAATTTATTAAGGCGTTTCAGCCTTCGTTGATGTCGTCGTTG 



600 



\3 
ER 

f r= 



Bbvl 
PstI | 
CviRI | | 

I II 



Alul 
CviJI 
Mnll | 

II 



BseRI 
Alul 
CviJI 
Fnu4HI 
BsrDI | 
CviJI | CviRI | 
NgoGV| | Mwol j 
Nlaivj j Tsel | 
III 



Bbvl 
Maellll Msel 



601 



TGCAGGAAATAAAGATAGCTGTGGAGGAGCCATTGCAGCTAACTCTGTTACTTTAACAAA 

h + I- H H + 

ACGTCCTTTATTTCTATCGACACCTCCTCGGTAACGTCGATTGAGACAATGAAATTGTTT 



660 



661 



Tsp509I AlwNI 
Dral | Mnll | 

Msel | j CviRI | j BsrI 

II I I I I I 

TAAC C CTG AAATAAC CTTTAAAGGAAATTATGCAGAAACTGG AGGAG CG ATTGG CTGTAT 



Bpml 
BseRI | 
CviJI | | 



-+ 720 



ATTGGGACTTTATTGGAAATTTCCTTTAATACGTCTTTGACCTCCTCGCTAACCGACATA 



Dpnl Sthl32I CviRI 

Sau3AI | HphI CviJI BscGI Mnll | BsmAI | 



Taal 



I 



721 



TGATCTTACTAATGGCTCACCTCCCCGTAAAGTCTCTATTGCAGACAACGGTTCTGTCCT 
+ H + h ^ + 

ACTAGAATG ATTAC CGAGTGGAGGGGCATTTC AG AG ATAACGTCTGTTG CCAAGAC AGGA 



780 



EcoRV 

Acil Haell Cjel | 

Mnll | Hhal| Hin4I Clal | 

Hpyl78III Msel (Thai Hin4l||BsmAI Bpml | TaqI 

I III III I I I | | 

TTTTCAAGACAACTCTGCGTTAAATCGCGGAGGCGCTATCTATGGAGAGACTATCGATAT 



781 



AAAAGTTCTGTTGAGACGCAATTTAGCGCCTCCGCGATAGATACCTCTCTGATAGCTATA 



■+ 840 
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Figure 7 (continued) 



ScrFi 
EcoRII I 



Cjel 
Maelll | 
MboII | J 



Bed Tsp509I 
Earl Nlalll | CviRI I 

II III I I I II 

CTCCAGGACAGGTGCGACTTTCATCGGTAACTCTTCAAAACATGATGGAAGTGCAATTTG 



841 



■+ 900 



GAGGTCCTGTCCACGCTGAAAGTAGCCATTGAGAAGTTTTGTACTACCTTCACGTTAAAC 
Cvi JI Hhal Mael 1 1 

I i I 

CTGTTCAACAGCCCTAACTCTTGCGCCAAACTCCCAACTTATCTTTGAAAACAATAAGGT 



901 



*+ 960 



GACAAGTTGTCGGGATTGAGAACGCGGTTTGAGGGTTGAATAGAAACTTTTGTTATTCCA 



Tsp509I 
CviRI 
Fnu4HI 
Alul | 
Cvi JI | 
Tsel j 
II 



CviJI 



Alul Tsp509I 
CviJI Bbvl | 
Hindi I I | Acelll) j 
II III 



961 



T ACGG AAAC C AC AG C C ACT ACAAAAG C T TC C ATAAATAATTTAGG AG CTG CAATTT ATGG 

+ H + 1 H j_ 

ATGCCTTTGGTGTCGGTGATGTTTTCGAAGGTATTTATTAAATCCTCGACGTTAAATACC 

Aatll 
Maelll 
Tsp45I 
BsaHI | 

Maellj j Ddel 
Maelll | j | Alul | 

Tsp4 5I | j j CviJI | 

Bf al | III MspAlI | 

BsmAI Spel| j || | BseMII PvuII | Msel 

I ill! 
AAATAATGAGACTAGTGACGTCACTATCTCTTTATCAGCTGAGAATGGAAGTATTTTCTT 



1020 



1021 



-+ 1080 



TTTATTACTCTGATCACTGCAGTGATAGAGAAATAGTCGACTCTTACCTTCATAAAAGAA 



Eco57I 
Apol | 
Tsp5 0 9I | 
Maell | I 



Tthlllll 
PstI | 
CviRI | j 

Dral CviRI Sfcl | | j 

I I I I I I I j j 
TAAAAACAATCTATGCACAGCAACAAACAAATACTGCAGTATTGCTGGAAACGTAAAATT 
1081 + + --- - + + + + 1140 

ATTTTTGTTAGATACGTGTCGTTGTTTGTTTATGACGTCATAACGACCTTTGCATTTTAA 
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Alul 
CviJl 
Hindi I I j 
Mwol | 



I I 
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Alul 
CviJl 
Mwol | Sf aNI 
I I I 



Acll 
Maell 
Hindi 
Hpal 
Msel | 
CviRI | | 
I I I 



TACAG CAATAGAAGCTTCAG C AGGG AAAGCTATATCTTTCTATG ATGCAGTTAACGTT C C 



1141 



-+ 1200 



ATGTCGTTATCTTCGAAGTCGTCCCTTTCGATATAGAAAGATACTACGTCAATTGCAAGG 



5s=? 

CO 

fn 

i0 



5= A 



Msel 
Tsp509I 
Alul 
CviJl 
Hpyl78III | 
Muni | | 

Bce83I Tsp509I Smll j j 
I I I I I 



Rsal 
Tat I | Maell 
I I I 



AC CAAAG AAACAATTG CT C AAG AG CTAAATTAAATG AAAAAG CG ACAAGTACANGGACGT 



1201 



■+ 1260 



TGGTTTCTTTGTTAACGAGTTCTCGATTTAATTTACTTTTTCGCTGTTCATGTNCCTGCA 



Bsp24I 
CjePI 
Cjel| 
BsmFI I 



Maelll 
Tsp45I 



1261 



BsaJI 

I 

TTCTANTTTCTGGGGGACTTCACGGAAATAAATCCCTATTCCACAGAAAGTCACTTCGCC 

+ + + + + b 

AAGATNAAAGACCCCCTGAAGTGCCTTTATTTAGGGATAAGGTGTCTTTCAGTGAAGCGG 



1320 



Cjel 
CjePI 
Bsp2 4I | 
II 

CTNGGGAT 

1321 1328 

GANCCCTA 
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Figure 8 

Restriction enzyme analysis of CPW100368 (RY 63 - SEQ ID NO. 8) 

Cjel 
Nlalll 
BsiHKAI 
Bspl286I 
BseSI | 
CviRI j 
ApaLI | | 



Msel Taal 
I I 

TTACTTGATTTATTTAACTGTATTCTCTATTGGTGCACCATGCTCCTAAAGCCACATGCT 



CviJI Nlalll 
Mwol | Nspl 



-+ 60 



AATGAACTAAATAAATTGACATAAGAGATAACCACGTGGTACGAGGATTTCGGTGTACGA 



Alul 
CviJI 
Hindlll | 
Cjel | | 



Sspl 



BsaJI 
Styl 



61 



I Nlalll | 

I I I 

ATGGGAGTATTTTTGATAAAAAGCTTTTCCCCAAAGACACATGAAATATTCTTTACCTTG 



TACCCTCATAAAAACTATTTTTCGAAAAGGGGTTTCTGTGTACTTTATAAGAAATGGAAC 



■+ 120 



MboII 
CviJI | 

I I 



Fokl 
Mnll | 
CviJI | | 
Earl I 1 I 



Bbvl 



Fnu4HI 
CviJI | 
Tsel | 
I I 



Dpnl 
BstYI | 
Sau3AI | 
Fokl | | 

I I I 



121 



GCTACTTACCTCTTCGGCTTTAGTTTTCTCCCTACATCCACTAATGGCTGCTAACACGGA 



CGATGAATGGAGAAGCCGAAATCAAAAGAGGGATGTAGGTGATTACCGACGATTGTGCCT 



-+ 180 



Hpyl8 8IX 
Alwl | 
HaelV | | 
Hin4I | | 

I 



BsmI 

Fnu4HI | Btsl | 

Hhal| |BstAPl| | 

Tsel | | Mwol | | 

I I 



Eco57I 
TspRI 
BsaJl| 
Styl | 
Bbvl 



181 



TCTCTCATCATCCGATAACTATGAAAATGGTAGTAGTGGTAGCGCAGCATTCACTGCCAA 



AGAGAGTAGTAGGCTATTGATACTTTTACCATCATCACCATCGCGTCGTAAGTGACGGTT 

Hpyl88IX Fokl 
SfaNI | Hpyl78III | Bfal Bael 

I I II II 

GGAAACTTCGGATGCTTCAGGAACTACCTACACTCTCACTAGCGATGTTTCTATTACGAA 



■+ 240 



241 



-+ 300 



CCTTTGAAGCCTACGAAGTCCTTGATGGATGTGAGAGTGATCGCTACAAAGATAATGCTT 
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Tsp509I 
CviRI | 

II 



PstI 
CviRI | 
Bael | | 
Sfcl | | 
I I I 



Alul 
CviJI 



Acelll 
Tthlllll | 
CjePI | | 

Mnll Mmel | j j 

III II 



TGTATCTGCAATTACTCCTGCAGATAAAAGCTGTTTTACAAACACAGGAGGAGCATTGAG 

301 + + + + + + 360 

ACATAGACGTTAATGAGGACGTCTATTTTCGACAAAATGTTTGTGTCCTCCTCGTAACTC 



361 



Bbvl 
Haell 
Hhal | 
EC047III | j 
Cjel | | | 

III I Ml 

TTTTGTTGGAGCTGATCACTCATTGGTTCTGCAAACCATAGCGCTTACGCATGATGGTGC 
+ + + + + 

AAAACAACCTCGACTAGTGAGTAACCAAGACGTTTGGTATCGCGAATGCGTACTACCACG 



BseRI 



Dpnl 
Bell 
Sau3AI 
Alul | 
CviJI | 

I I 



Drdll 
CjePI (CviRI 



Fnu4HI 
Bed | 
Msll| | 
Nlalll | | Tsel | 
III II 



420 



Msel 
Plel 
Bpml 
BseMII 
Maelll 
Tsp4 5I 
CjePI 



Hinf I 
Tf il 
Hpyl78III | 
Ddel | | 
AceIIl| | 



CjePI 

Msel | Alul 

Tsp509I | | CviJI 

CviRI | | | Cjel Bsbl | 

II II I II II I I 

TGCAATTAACAATACCAACACAGCTCTTTCTTTCTCAGGATTCTCGTCACTCTTAATCGA 

421 + + + + + + 

ACGTTAATTGTTATGGTTGTGTCGAGAAAGAAAGAGTCCTAAGAGCAGTGAGAATTAGCT 



Hinf I 
Taql | 



480 



Alul 
CviJI 
Ddel I 



Acelll 
BseMII 
Sthl32I 
Bslll 



Fnu4HI 

Taul Maelll Mnll 

Acil| Tsp45I Mnll | 

i i i i ii ill 

CTCAGCTCCAGCAACAGGAACTTCGGGCGGCAAGGGTGCTATTTGTGTGACAAATACAGA 
481 + + + + + + 540 

GAGTCGAGGTCGTTGTCCTTGAAGCCCGCCGTTCCCACGATAAACACACTGTTTATGTCT 
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Eco57I 
TspRI 
Maelll 
Tsp45I 
BsrI | 
HphI | | 



Rsal 
I 

GGGAGGTACTGCGACTTTTACTGACAATGCCAGTGTCACCCTCCAAAAAAATACTTCAGA 



PCT/CA99/01230 



Bbvl 
Acelll | 
Hpyl88IX| 
Mnll | j 



541 



•+ 600 



CCCTCCATGACGCTGAAAATGACTGTTACGGTCACAGTGGGAGGTTTTTTTATGAAGTCT 



AlwNI 
BstAPI 
PstI 
CviRI 
BsaXI 



C3 

iw 



3=^ 



Fnu4HI 
Sfcl 
Alul | 
CviJI | 
Tsel | 
Bed | | 
I II 



Ddel 

SfaNI | Alul 

Dpnl | | CviJI 

Sau3AI | j j Fnu4HI | 

Clal| j j | Hin4l| j 

Taqlj | j j Pf 111081 Tsel j j 

III I I I III 

AAAAG ATGGAGCTG C AGTTTCTG C CTAC AGC ATCGATCTTGCTAAG ACT ACGACAGCAGC 

601 + + + + + + 660 

TTTTCTACCTCGACGTCAAAGACGGATGTCGTAGCTAGAACGATTCTGATGCTGTCGTCG 



Sfcl 
Apal 
Banll 



Mwol Sfcl 
I I 



Bspl286I 




Bmgl | 




BseSI j 




CviJI | 




Haelll | 




NgoGV | 




NlalV | 




Eco0109l| | 




NgoGV | | 




Nlaivj j 




Sau96l| j 


| Mnll 


EcoO109I | | j 


j Rsal| 


Sau96I j j | 


(Cjel | | 


Acil Ml | 
1 III 1 


j Tat I | | 
1 1 II 



Acelll 
Dpnl 

Bbvl | Faul 
Sau3Al|| Sthl32l| Sau96l|| | |CjeI || Sfcl 

Ddel ijjcjelBfal || Acil ||j | |TatI || CjePI 

I III I I II I III I I I II II 

TCTCTTAGATCAAAATACTAGCACAAAAAATGGCGGGGCCCTCTGTAGTACAGCAAACAC 

661 + + + 4- + + 720 

AGAGAATCTAGTTTTATGATCGTGTTTTTTACCGCCCCGGGAGACATCATGTCGTTTGTG 
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Figure 9 (continued) 



CjePl 
BseMll 
BseRI 

Tthlllll BstEII 
BsaJI | Maelll 
Styl |Hpyl78III Taal 
Taal | | Ddel |Tsp4 5I 

III I I I 



HphI 



Sf cl 
Mnll | 
I I 



TACAGTCCAAGGAAACTCAGGAACGGTGACCTTCTCCTCAAATACTGCTACAGATAAAGG 

721 -t- + + + + + 780 

ATGTCAGGTTCCTTTGAGTCCTTGCCACTGGAAGAGGAGTTTATGACGATGTCTATTTCC 



Maelll 
Hinfl IPlel 



Dpnl Bfal 
BstYI | Cac8I | 

Sau3AI | Alwl SfaNI | j 

III III III 

TGGGGGG ATCTACTCAAAAGAAAAGGATAGCACGCTAGATGCCAATACAGGAGTCGTTAC 

781 + + + + + + 840 

ACCCCCCTAGATGAGTTTTCTTTTCCTATCGTGCGATCTACGGTTATGTCCTCAGCAATG 



BscGI 
Tthlllll | 
CviRI | | 
Sthl3 2I | j j 
I I I I 



Hpyl8 8IX 

Banll 
BsiHKAI 
Bspl286I 
Sac I 
Alul | 
CviJI | 

h I 



BsaBI 



Bce83I 



I 



CTTCAAATCTAATACTGCAAAGACGGGGGGTGCTTGGAGCTCTGATGACAATCTTGCTCT 

841 + + + + + + 900 

GAAGTTTAGATTATGACGTTTCTGCCCCCCACGAACCTCGAGACTACTGTTAGAACGAGA 



Rsal 

Mspl Smll Seal 

BsrFI | Bsbl |TatI |Hpyl78III 
II I I I I I 



Fnu4HI 
Bpull02I 
Ddel 
CviJI | 
Mspl | j 
BsrFI | j |TseI 
II II I 



BseMII 
Mwol | 

I I 



TACCGGCAACACTCAAGTACTTTTTCAGGAAAATAAAACAACCGGCTCAGCAGCACAGGC 

901 + + + + + + 960 

ATGGCCGTTGTGAGTTCATGAAAAAGTCCTTTTATTTTGTTGGCCGAGTCGTCGTGTCCG 



Sthl32I 
Mspl | 
Neil j 

Bbvl ScrFI j Sfcl 

III I 
AAATAACCCGGAAGGTTGTGGTGGGGCAATCTGTTGTTATCTTGCTACAGCAACAGACAA 

961 + + + + + + 1020 

TTTATTGGGCCTTCCAACACCACCCCGTTAGACAACAATAGAACGATGTCGTTGTCTGTT 
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Alul 
CviJI 
BseMII 
Hpyl78III 
Hinf I 
Tf il 

CviJI Hpyl8 8IX | | | | Bfal 

Bsri | Ddel || j II Spel|CjeI 

II III 
AACTGGATTAGC CATTTCTCAGAATCAAGAAATG AG CTTCACTAGTAATACAACAACTGC 

1021 +- + + + + + 1080 

TTGACCTAATCGGTAAAGAGTCTTAGTTCTTTACTCGAAGTGATCATTATGTTGTTGACG 



Cjel 
Cjel 



Mwol 
Dpnl | 
Sau3AI | | 
I I I 



I 



Hpyl78III 
Rsal | 
TatI | j Bed 



Taql 

Fokl Cjel | 

I I I I I II 

GAATGGTGGAGCGATCTACGCTACTAAATGTACTCTGGATGGAAACACAACTCTTACCTT 

1081 +■ + + + + + 1140 

CTTACCACCTCGCTAGATGCGATGATTTACATGAGACCTACCTTTGTGTTGAGAATGGAA 



Hpyl88IX 
Dpnl | 
Sau3AI | j 

I I 



Fokl 
Alul | 
CviJI j 
Ecil | | 
Acil| !| 
II M 



AlwNI 
I 



CGATCAGAATACTGCGACAGCAGGATGTGGCGGAGCTATCTATACAGAAACTGAAGATTT 

1141 + + + + + + 1200 

GCTAGTCTTATGACGCTGTCGTCCTACACCGCCTCGATAGATATGTCTTTGACTTCTAAA 



Maelll 
Taal 
Tsp4 5I 
NgoGV 
NlalV 
BscGI | 
EC057I | j 

Eco57I III! BsaHI 
Sthl32I | till Narl 
Msel || I I I I BanI 
Aflll| || I I I I Fnu4Hl| 
Mbollj || I I I I Taulj 
Smll j j j Rsal | j j Acil | j 

ii i i i i i i mi 

TTCTCTTAAGGGAAGTACGGGAACCGTGACCTTCAGCACAAATACAGCAAAGACAGGCGG 

1201 + + + + + + 

AAGAGAATTCCCTTCATGCCCTTGGCACTGGAAGTCGTGTTTATGTCGTTTCTGTCCGCC 



1260 
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Figure 9 (continued) 



Haell 
Hhal | 
NgoGV| | 

Nlaivj | CviJI | Acelll | BspMI 

III I I I I I 

CGCCTTATATTCTAAAGGAAACAGCTCGCTGACTGGAAATACCAACCTGCTCTTTTCAGG 

1261 + + + + + + 1320 

GCGGAATATAAGATTTCCTTTGTCGAGCGACTGACCTTTATGGTTGGACGAGAAAAGTCC 



CacSI 
Alul | BsrI 
CviJI j Acelll | 



m 
Co 



s 

C3 



Sthl32I 
Tsp509I 
MboII 
Apal 
Banll 
Bspl286I 
Aval 
Bmgl 
BseSI 
CviJI 
Haelll 
NgoGV 
NlalV 
Sau96I 
BscGI 
Sau96I 
Eco57I 
Alul | 

CviJI j | | | | | | || Hpyl78III 
Sthl32I | j j | I j j j II Mnll | Acil 

I I I 

GAACAAAG CTACGGG C CCG AGTAATTCTTC AG CAAATCAAG AGGGTTG CGGTGGGGCAAT 

1321 + + + + + + 1380 

CTTGTTTCGATGCCCGGGCTCATTAAGAAGTCGTTTAGTTCTCCCAACGCCACCCCGTTA 



CviJI 
Bf al | 
Mwol | 



Dpnl 
NgoGV 
NlalV 
BamHI 
BstYI 
Sau3AI 
Hpyl78III 
HaelV 
Hin4I 
Alwl | 
Hinfl| | 
Tfilj j 
II I 



Alwl 



Clal 

TaqI CviRI 



1381 



CCTAG C CTTTATTGATTCAGG ATCCGTAAG CGATAAAACAGG ACTATCGATTG CAAAC AA 

+ H (■ 4- + + 

GGATCGGAAATAACTAAGTCCTAGGCATTCGCTATTTTGTCCTGATAGCTAACGTTTGTT 
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CviRl 
Fnu4HI 

Bbvl Tsel | 

CviJl | Bfal Mnll | j 
Tthlllll | |Spel|Cjel| j j 



Taal 



Cjel 
Cjel 
Mwol 
CjePI | 
Dpnl | | 
Sau3AI f I I 



1441 



CCAAGAAGTCAGCCTCACTAGTAATGCTGCAACAGTAAGTGGTGGTGCGATCTATGCTAC 

+ + + + + + 1500 

GGTTCTTCAGTCGGAGTGATCATTACGACGTTGTCATTCACCACCACGCTAGATACGATG 



Cj ePI 
NgoGV 
NlalV 
CviJl | 



Eco57I 
Bcefl [ 
Cjel | | 



Mnll 
Beef I I 



1501 



Rsal 

TatI | Bsrl 

II III III 

CAAATGTACTCTAACTGGAAACGGCTCCCTGACCTTTGACGGCAATACTGCTGGAACTTC 

+ + + + + + 1560 

GTTTACATGAGATTGACCTTTGCCGAGGGACTGGAAACTGCCGTTATGACGACCTTGAAG 



Maelll 
Eco57I Taal 

D P nI Rsal Tsp45I 

Sau3AI I TatI | NgoGV I 

Hpyl78III Hin4I | | AlwNI MboII Eco57I | | NlalV I 

I I I I I I I I I | | 

AGGAGGGGCGATCTATACAGAAACTGAAGATTTTACTCTTACAGGAAGTACAGGAACCGT 



1561 



1621 



+ + + + + 1620 

TCCTCCCCGCTAGATATGTCTTTGACTTCTAAAATGAGAATGTCCTTCATGTCCTTGGCA 

Haell 
Hhal 
NgoGV 
NlalV 
BsaHI 
Marl 
BanI 
Fnu4HI | 
Taul j 
Acil | | 

III. .. 

G AC CTTCAG CACAAATAC AG C AAAGACAGG CGG CG CCTTATATTCTAAAGG CAACAACT C 

+ + + + + + 1680 

CTGGAAGTCGTGTTTATGTCGTTTCTGTCCGCCGCGGAATATAAGATTTCCGTTGTTGAG 
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Sthl32I 
Tsp509I 
MboII 
Apal 



Banll 
Bspl286I 
Aval 
Bmgl 
BseSI 
CviJI 
Haelll 
NgoGV 
NlalV 
Sau96I 
BscGl| 
Sau96l| 
Eco57I 



Alul 
CviJI 



BspMI Sthl32I 



I I 



1681 



TCTGTCTGGTAATACCAACCTGCTCTTTTCAGGGAACAAAGCTACGGGCCCGAGTAATTC 

+ + + + + ^ 1740 

AGACAGACCATTATGGTTGGACGAGAAAAGTCCCTTGTTTCGATGCCCGGGCTCATTAAG 



1741 



AlwNI 
Plel 
Bcgl 
Hin4I 
Hinfl | 
Hpyl78III | j 
Smll I I I 

I I II II I |, 

TTCAGCAAATCAAGAGGGTTGCGGTGGGGCAATCCTATCGTTTCTTGAGTCAGCATCTGT 

+ + + + + + 1800 

AAGTCGTTTAGTTCTCCCAACGCCACCCCGTTAGGATAGCAAAGAACTCAGTCGTAGACA 



Hpyl78III Acil 
Mnll | Bcgl | 



Bce83I 
Rsal| 
Seal) 

SfaNI | | Hpyl78III 
Tat I j | Plel Hinfl I 



Hinfl 
Maell j 
MboII ( j 
CjePI | | | 

I II I 



Plel 
BsmAI | Cjel 



1801 



AAGTACTAAAAAAGGACTCTGGATTGAAGATAACGAAAACGTGAGTCTCTCTGGTAATAC 

+ + + + + + I860 

TTCATGATTTTTTCCTGAGACCTAACTTCTATTGCTTTTGCACTCAGAGAGACCATTATG 
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Figure 9 (continued) 



CjePl 
CviRI Taal 

I I 



Cjel 
Mwol 
CjePI 
Dpnl | 
Sau3AI | | 
Acil | | | 

I III 



CjePI 
Hinf I 
Cjel 
Plel 
Nlalll 



CviRI | 

BsiHKAI | | 

Bspl286I | | 

I I I 



TGCAACAGTAAGTGGCGGTGCGATCTATGCGACCAAGTGTGCTCTGCATGGAAACACGAC 

1861 + + + + + + 1920 

ACGTTGTCATTCACCGCCACGCTAGATACGCTGGTTCACACGAGACGTACCTTTGTGCTG 



PstI 
CviRI | 
Sfcl | j Dpnl 
Cjel | j j Sau3AI | 
Mnllj | | Hin4I | | 
Bed Mwol | j j Mwol | | j BseRI 

I II I I I I I I I 

TCTTACCTTTGATGGCAATACTG CCGAAACTGCAGGAGG AG CG ATCTATACAGAAACCGA 

1921 + + + + + + 1980 

AGAATGGAAACTACCGTTATGACGGCTTTGACGTCCTCCTCGCTAGATATGTCTTTGGCT 



Maelll 
Taal 
Tsp45I 
NgoGV 

BscGI NlalV 
Sthl32I| BscGI j 

MboII | |Eco57I | | 
Sthl32I | | j Rsal j j 



Mwol 



AGATTTTACTCTTACGGGAAGTACGGGAACCGTGACCTTCAGCACAAATACAGCAAAGAC 

1981 + + + + + + 2040 

TCTAAAATGAGAATGCCCTTCATGCCCTTGGCACTGGAAGTCGTGTTTATGTCGTTTCTG 

Banll 
Bspl286I 

CviJI | XmnI CviJI 

II I I 

AGCAGGGGCTCTACATACTAAAGGAAATACTTCCTTTACCAAAAATAAGGCTCTTGTATT 



2041 



■+ 2100 



TCGTCCCCGAGATGTATGATTTCCTTTATGAAGGAAATGGTTTTTATTCCGAGAACATAA 



Hpyl78III 

Apol Dpnl | 

Tsp509I Sau3AI | j 

Hpyl78III | Sfcl | j I Alwl 

II I II I I 

TTCTGGAAATTCAGCAACAGCAACAGCAACAACAACTACAGATCAAGAAGGTTGTGGTGG 

2101 + + + + + + 2160 

AAGACCTTTAAGTCGTTGTCGTTGTCGTTGTTGTTGATGTCTAGTTCTTCCAACACCACC 
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gure 9 (continued) 



Hpyl88ix 
AlwNI | 
BplI 



Dpnl 
Hin4I | 
Sau3Al| | 
Ml 



Hinf I 
Hpyl8 8IX | 
Ddel | j 
Mnll | | j 

I I I I 



| Alul 
| CviJI 
|BseMII | 
|PleI | f 

I III 



Msel 
Alul | 
CviJI | 
Hindlll | | 
I I I 



AGCGATCCTCTGTAATATCTCAGAGTCTGACATAGCTACAAAAAGCTTAACTCTTACTGA 



2161 



■+ 2220 



TCGCTAGGAGACATTATAGAGTCTCAGACTGTATCGATGTTTTTCGAATTGAGAATGACT 



Msel 



Msel 



Beef I 



AAATGAGAGTTTAAGTTTCATTAACAATACGGCAAAAAGAAGTGGTGGTGGTATTTATGC 



2221 



-+ 2280 



TTTACTCTCAAATTCAAAGTAATTGTTATGCCGTTTTTCTTCACCACCACCATAAATACG 



BseMII 
TspRI 
Hinfl | 
Tf il | 

Ddel Ddel BtsI | j | Bed 

I I I I II I 



Sthl3 2I Mull 



TCCTAAGTGTGTAATCTCAGGCAGTGAATCCATAAACTTTGATGGCAATACTGCTGAAAC 



2281 



-+ 2340 



AGGATTCACACATTAGAGTCCGTCACTTAGGTATTTGAAACTACCGTTATGACGACTTTG 

Avail 

BseRI Sau96I 
NspV| Alul | 

Hpyl78III Tagl| TaqI CviJI Taal BsmAI 

I II I II I 

TTCGGGAGGAGCGATTTATTCGAAAAACCTTTCGATTACAGCTAACGGTCCTGTCTCCTT 

2341 + -h + + + + 2400 

AAGCCCTCCTCGCTAAATAAGCTTTTTGGAAAGCTAATGTCGATTGCCAGGACAGAGGAA 



Hpyl78III 
Mnll | 
Tsp509I| | Mnll 

II II 



Bpml 
Haell 
Hhal 



NgoGV 
NlalV 

BsaHI | 
Narl j 

Banl | | 
Mwol | | j 

I III 



CviJI 
I 



Acil 
I 



Mnll 



TAC CAATAATTCTGG AGG CAAGGGAGG CG CCATTTATATAG CCG ATAGCGGAGAACTTTC 

2401 + + + + + + 2460 

ATGGTTATTAAGACCTCCGTTCCCTCCGCGGTAAATATATCGGCTATCGCCTCTTGAAAG 
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Ddel CviJI 



BseMII Ddel BsaXI 
NgoGV|BseMII | Alol | 
Bed Ddel NlaIV|MnlI | j Ppil| 
J I I I Mill || 
CTTAGAGGCTATTGATGGGGATATTACTTTCTCAGGGAACCGAGCGACTGAGGGAACTTC 
2461 + + + + + + 2520 

GAATCTCCGATAACTACCCCTATAATGAAAGAGTCCCTTGGCTCGCTGACTCCCTTGAAG 



Dpnl 
Sau3AX | 
Taql| | 
Alwl | | | 

I II I 



ScrFI 
BsaJI 
EcoRII 
NgoGV| 
NlalV) 
BanI | | 
MslI | | j 
II 



ScrFI 
AlwNI 
EcoRII 
Alul 
CviJI 
Fnu4HI 
Tsel 



Fnu4HI 
CviRl 
Tsel 
Cac8I 
Alul 
CviJI 
Hindi I I | 
Dpnl | | 

Sau3AI | Ddel j | 



AACTCCCAACTCGATCCATTTAGGTGCCAGGGGCAAGATCACTAAGCTTGCAGCAGCTCC 

2521 + + + + + + 2580 

TTGAGGGTTGAGCTAGGTAAATCCACGGTCCCCGTTCTAGTGATTCGAACGTCGTCGAGG 



Dpnl 
Sau3AI | 
Alwl | | 



Mnll 
SfaNI 
Alul Hpyl78III 
CviJI BslI | 

Hin4I | CviRI | | 
Bed | | Mnll j j 



Acelll 
Bbvl | 
Bbvl | j 

III III Ml 

TGGTCATACGATTTATTTTTATGATCCTATTACGATGGAAGCTCCTGCATCTGGAGGAAC 

2581 + + + + + + 2640 

ACCAGTATG CTAAATAAAAATACTAGG ATAATG CTAC CTTCGAGG ACGTAGACCTC CTTG 



BseRI Xcml 
Alul | Mnll | 

Bpml BseRI CviJI j Mnll | j 

ii ii in 

AATAGAGGAGTTAGTCATCAATCCTGTTGTCAAAGCTATTGTTCCTCCTCCCCAACCAAA 

2641 + + + + + + 2700 

TTATCTCCTCAATCAGTAGTTAGGACAACAGTTTCGATAACAAGGAGGAGGGGTTGGTTT 
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igure 9 (continued) 
Avail 

Sau96I BsmI Hpyl78III 

Bsl1 I Bce83I | SmlX | Apol 

PflMI I MboII | CviJI| I Tsp509I 

■I II II I | 

AAATGGTCCTATATAGAAGAAAAACGAATGCTCTTTGTAAGGCTCAAGAGTAAAAAATTC 
2701 + - + + + + + 276Q 

TTTACCAGGATATATCTTCTTTTTGCTTACGAGAAACATTCCGAGTTCTCATTTTTTAAG 

ECOS7I 

Hpyl88IX Apol | 

Bcefl | Fnu4HI EcoRI | 
Bbvl | J Tsel| Tsp509I I 

Ml II II 

TAAAGGTATTCTCTCAATAGGTTCTGAAGTGCTGCCGTAGAATTCATAAATATCTC 
2761 + + + + _. + 2Q16 

ATTTCCATAAGAGAGTTATCCAAGACTTCACGACGGCATCTTAAGTATTTATAGAG 
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Figure 9 

Restriction enzyme analysis of CPN100624 (RY 64 - SEQ ID NO. 9) 

Msel 
Nlalll| 
Af 1III | | Dral 
BspLUllI | | Swal 
Sspl [Nsplj Msel I 

I i n ii 

TCAAATATATGAGTTTACTAACTCTGTAATATTCAACATGTTAATAAGCATATTTAAATA 
1 + + + + + + 60 

AGTTTATATACTCAAATGATTGAGACATTATAAGTTGTACAATTATTCGTATAAATTTAT 

Hpyl78III 
Apol Bf al | 

Tsp509I Psil Xbal| | Tsp509I 
I I II! ! 

TAAATTTATAAACTTCTAGACAACAAATTGATGATTTTTTATGACAAACTCTATTTTCAT 
61 + + + + + + 12Q 

ATTTAAATATTTGAAGATCTGTTGTTTAACTACTAAAAAATACTGTTTGAGATAAAAGTA 

Hhal 
TspRI 

Fokl BsmAI BtsI | 

SimI | DrdI Ddel | BseMII | | 



ATCAAAG TTTGGATGTTTATGCGACCCATTTGTCTCAGCATTTTAT C CC ACTGCGCTATG 
^ + + + h + 

TAGTTTCAAACCTACAAATACGCTGGGTAAACAGAGTCGTAAAATAGGGTGACGCGATAC 



Hpyl78III 
Mnll | 

Hpyl78III Hpyl88IX | Bf al j 

BsmFI | Mnll | |xbal| | 

II I I I Ml 

TTGTTCCTTATCAGGAAATGAAGTCCCTAACCTCGCCTCTTGTCAGATGTCTAGAAAAGA 
181 + + + + + + 24Q 

AAC AAGG AAT AG T C C TTTA C TT CAGG G ATTGG AG C GG AG AACAGTCT AC AG AT C TTTTC T 



SUBSTITUTE SHEET (RULE 26) 



WO 00/39158 



PCT/CA99/01230 



Figure 9 (continued) 



56/96 



241 



BsmFI 
Banll 
BsaJI 
Bspl286I 
Styl 
CviJl 
Hpyl78III 
Maelll 
Hpyl88IX 
Bpml 
Alul 
CviJI 
Hindi I I 
Btrl | 
Maell| BsmAlj 
AfHII | j BsmBI | 

I I I II . . , , , , , 

CATCTCTGCTTTCCACACGTCTCCAAGCTTCCGTCTGAATGTAACTCCAGAGCCCTTGGT 



GTAGAGACGAAAGGTGTGCAGAGGTTCGAAGGCAGACTTACATTGAGGTCTCGGGAACCA 



+ 300 



MboII 
Mnll I 



Hpyl78III 
Maelll 
Msel Tsp45I 
Taqll| Hinfl | 
Mnll I I Tfil I 



ScrFI 
BsaJI | 
EcoRII 



301 



I! Mill. 

TTCCTCCTTTCGTCCCTCTAATCTTCTTAATGGATTCGGTCACGATATAACCCAGGACAT 
+ + + + + ^ 

AAGGAGGAAAGCAGGGAGATTAGAAGAATTACCTAAGCCAGTGCTATATTGGGTCCTGTA 



Tsp509I 



Tsp50 9I 



Psil 



Mnll 
Xcml 
BslI | 
Pflll08l| j 
Mnll I I I Bed 



361 



CACAATTACAGGAAACTCTATCAATTCTGTTATAGATTATAACTACCACTACGAGGATGG 
+ + + + + ^ 

GTGTTAATGTCCTTTGAGATAGTTAAGACAATATCTAATATTGATGGTGATGCTCCTACC 



Apol 
Tsp509I 
Nlalll | 
CviRI | | 
BsmI Fokl |NspI | 



Hpyl8 8IX 
I 



Msel 
Aflll| 
Smll | 



421 



AGGCATTCTTGCATGTAAAAATTTGTTCATTTCTGAAAATAAAGGAAACTTAAGTTTTGA 
+ + + + + ^ 

TCCGTAAGAACGTACATTTTTAAACAAGTAAAGACTTTTATTTCCTTTGAATTCAAAACT 
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BplI 
Hpyl78III 
Taal 
Sthl32I 



Alul 
CviJI 

I 



Sf cl 
Banll 

Hpyl78III Bspl286I 
BstXI | CviJI | 
Mnll | j RleAI J 
Taal | j Hin4l| | 
II 



TspRI 
Bpml | 



BsmI 



Ddel 



481 



AAGGAATAGCTCCCACAGTTCTGGAGGGGCTCTCTACAGTGTTCGGGAATGCTGGATTTC 
- + + + + + + 

TTCCTTATCGAGGGTGTCAAGACCTCCCCGAGAGATGTCACAAGCCCTTACGACCTAAAG 



Hpyl8 8IX 
Hinfl | 
Tfil I 



Alul 
CviJI 
Eco57I 
Mwol 
BpulOI 
CviJI | 
Fnu4HI | j 
Taul | | 
Acil| | Ddel 
I I 



541 



Hpyl78III 



TAAGAATCAGAACTACTCGTTTATTTCAAATGCGGCTTCCTTAGCTACTACTACAACTTC 

ATTCTTAGTCTTGATGAGCAAATAAAGTTTACGCCGAAGGAATCGATGATGATGTTGAAG 

Bfal 
CviRI 
Nlalll 
Nspl 

CviJI Mwol | 



600 



Alul 
CviJI 



Ddel 



601 



AGGATTTGGTGGGGCTATACATG CACTAGATAG CTATATTACAAATAACTTAGGAGAAGG 
+ + + + + + 

TCCTAAACCACCCCGATATGTACGTGATCTATCGATATAATGTTTATTGAATCCTCTTCC 



Mnll Alul 
Mnll | CviJI BseRI 

Hin4I BsmAl| j Hin4l| BseRI I 

I M I il II 

ACAATTCTTAGATAATGTCTCTAAAAATAGAGGAGGAGCTATCTATGTTGGGGTGAGTTT 

+ + + + - + + 72Q 

TGTTAAGAATCTATTACAGAGATTTTTATCTCCTCCTCGATAGATACAACCCCACTCAAA 



Tsp509I Ddel 



661 
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Hphi 



Avail 
EcoO109I 
PspSII 

sausex 

Sse8647I 
Ddel | 



Hpyl78III 



TthlllXI 
Hinfl | 
Tfil I 



721 



ATCAATCACAGACAACTTAGGTCCTATCGTTATCAAGAAAAATCAAACATTAGAAGATTC 
h ^ h + + ^ 

TAGTTAGTGTCTGTTGAATCCAGGATAGCAATAGTTCTTTTTAGTTTGTAATCTTCTAAG 



780 



CviJI 
PstI 
BseRI 

MboII CviRI 
Mnll SfaNI 
Alul | Bcefl Sfcl | 

CviJI Hin4I I BstAPI I 



Mnll | | MboII 



Mwol 



Tsp509I 
Fokl I 



781 



CAGCTTTGGAGGAGGCATCTTCTGCAGAGCCGTAAATATAGAAAGGAATTATCAAAACAT 
+ + + + + ^ 

GTCGAAACCTCCTCCGTAGAAGACGTCTCGGCATTTATATCTTTCCTTAATAGTTTTGTA 



Hin4I 
MboII 
Hinfl 
Bfal 
Avrll 



Eco57I 



Hpyl78III 
Bsp24I | 
Cjel | 
CjePI | 



BsaJI 
Cjel 
Styl 
CjePI | 
Bsp24ll j 



841 



CCAAATCAATG ATAATGCTTCAGGACAAGGGGTGGTATATTTTCTG C C C CT AGGAGTCAT 
+ + + + + + 

GGTTTAGTTACTATTACGAAGTCCTGTTCCCCACCATATAAAAGACGGGGATCCTCAGTA 



Plel 



Earl Tsp509I 



HaelV 
Hin4I 
Fokl | 
Dpnl | | 
Sau3AI [ I 



Msel 
Sf aNI | 
Acil Tsp509I | (Mnll 

I I 



901 



TATCTCTTCAAATAAAGAAATT ATAGAGATC AGCAATC ACTC CG CAT C CTCAATTAAC AC 
h ^ h + - + + 

ATAGAGAAGTTTATTTCTTTAATATCTCTAGTCGTTAGTGAGGCGTAGGAGTTAATTGTG 



960 
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Figure 9 (continued) 



sfaNl 
Hpyl78III I 

I I 



Acil 
Sthl32I 
Mspl | 
Neil j 
ScrFI Bsllj 
I I I 



Rsal 
Nlalll | 
I I 



Nlalll 
Hpyl78III | 
Mnll | | 
Ddel Real j j 

I III 



961 



AGCATCAGGAAAACTATATCCCGGTGGTGGCGGTATCATGTGTACCTCCCTTAGTCATGA 

h K 1 + (- 4- 

TCGTAGT C CTTTTG ATATAGGG CCAC CAC CGCCATAGTACACATGGAGGGAATCAGTACT 



1020 



BstZ17I 
Ecil 
Acil 



Fnu4HI 



Tsel| Ddel 



Beef I 
Bbvl 
Fnu4HI | 
Taul j 
Acil | j 



ACCI 



Msel 

I II I III 

GAACAATCCCAAAGGTCTTATCTTTAACAATAAAACGGCAGCACTTAGCGGCGGAGTATA 

1021 + + + + + + 1080 

CTTGTTAGGGTTTCCAGAATAGAAATTGTTATTTTGCCGTCGTGAATCGCCGCCTCATAT 



HaelV 
Hin4I 
Dpnl 
MboII 
Bglll 
BstYI 
Sau3AI 
BssSI | 

I I 



Acil 
Avail | 
RsrII j 
Sau96I j 
Taal j 



Eco57I 
Msel 
Vspl 



CACACGAGATCTTTCATCTTCCAAAATAACGGTCCGCACAGCATTTATTAATAACTCTGC 

1081 +- + + + + + 1140 

GTGTGCTCTAGAAAGTAGAAGGTTTTATTGCCAGGCGTGTCGTAAATAATTATTGAGACG 



Banll 
Bspl286I 
Hpyl78III CviJI | 
Mull | Hin4l| | 
II III 



Rsal Apol 
BseRI Seal Tsp509I 
BplI | Tat I | MboII | Mnll 
I I I I III 



GACTTCAGGAGGGGCTCTCATCAATCTTTCTGGTATAGGAAGTACTCCTCAAAATTTCTT 

1141 + + + + + + 1200 

CTGAAGTCCrCCCCGAGAGTAGTTAGAAAGACCATATCCTTCATGAGGAGTTTTAAAGAA 



Mnll 

Pstl| BseRI 

CviRI | | Beef I MboII | 

Sfcl | | | Msel | MboII | j 

I I II II III 

CCTCTCTGCAGACTACGGCGATATTCTATTTAACAATAATACAATCACATCTTCTTCTCC 

1201 +■ + + + + + 

GGAGAGACGTCTGATGCCGCTATAAGATAAATTGTTATTATGTTAGTGTAGAAGAAGAGG 



1260 
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Sthl32I 
Mill I 
Mspl 
Neil 
ScrFI 
BslI | 
I I 



CviRI 
Bbvl | 



Fnu4HI 
Tsel 
Sthl32I 
BstAPI | 
I I 

Mwol | 
I I 



Neil 
ScrFI 
BsaJI | 
Mspl | 
II 



Msel Msel Bfal 



TCAACCCGGATATAGAAATGCACTCTATGCTGCTCCGGGGATTAACTTAAAACTAGGAGC 

1261 + + + + + + 1320 

AGTTGGGCCTATATCTTTACGTGAGATACGACGAGGCCCCTAATTGAATTTTGATCCTCG 



Hpyl88IX 
Dpnl 
Sau3AI | 
BsaBI | | 
Hpyl78III | I 

Dpnl I I I I Dpnl 
Sau3AI | | | I | BstYI 
Sf cl || I | I I Sau3AI 
Dpnl | || | I | | HaelV | 
Sau3AI | j || | I I I Hin4I j 
Alwl || | || I j j | Alwl | | 

I III I I M I I III 

AAGACAGGGTTATAAAATTCTCTTTTATGATCCTATAGATCACGATCAGACGACAACAGA 

1321 +- + + + + + 1380 

TTCTGTCCCAATATTTTAAGAGAAAATACTAGGATATCTAGTG CTAGTCTG CTGTTGT CT 



Apol 
Tsp509I 
Psil | 

I I 



Bsbl 
Taal 
NfgoGV 



NlalV 
Ban I 
BstXI | 

Tsp509I BsaJI | j 

Sfcl Msel | HphI BccI Styl j j 

I II I I II I 

TCCTATAGTATTTAATTATGAACCCCATCACCTTGGCACCGTGTTGTTTTCCGGAATCAA 

1381 +* + + + + + 1440 

AGGATATCATAAATTAATACTTGGGGTAGTGGAACCGTGGCACAACAAAAGGCCTTAGTT 



Hinf I 
Tf il 
Hpyl78III 
Mspl | 
BsaWl| | 
BspEI | j 



1441 



CjePI 
MboII | 

Hinfl Apol | j Earl 

Tfil CjePI TspSOSI | j Hpyl78III 

II III I 
TGTAGATTCTAACGCAACAAATCCATTGAACTTCCTATCAAAATTTTCTAACTCTTCACG 
+ + + + + + 150 0 

ACATCTAAGATTGCGTTGTTTAGGTAACTTGAAGGATAGTTTTAAAAGATTGAGAAGTGC 
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Dpnl 
Sau3AI 
Sthl32I 
Bbvl 



BsiHKAI 
Bspl286I 
Cac8I 



I 

I I 



MboII 
Fnu4HI | 
Cvi JI | | 
Tselj j 
II I 



CviRI 



I 



ACTTGAAAGGGGTGTGCTCGCTATTGAAGATCGGGCTGCTATTTCTTGCAAAACCCTATC 

1501 + + + + - + + 1560 

TGAACTTTCCCCACACGAGCGATAACTTCTAGCCCGACGATAAAGAACGTTTTGGGATAG 



Mwol 
Neil 
ScrFI 
Smal 
Mspl 
Neil 
ScrFI 
Aval 
BsaJI 
CviJI 
Haelll 
Sau96I 
Sthl32I 
Bbvl 

Bsml Hpyl7 8III 

BsrI | Fnu4HI Msel | 

Bmrl | j Maell Tsel | Vspl | 

III I Mil 

GCAAACTGGGGGCATTCTACGTTTAGGAAACGCAGCATTAATCAGGACGAAAGGCCCGGG 

1561 + + + + + + 1620 

CGTTTGACCCCCGTAAGATGCAAATCCTTTGCGTCGTAATTAGTCCTGCTTTCCGGGCCC 

Dpnl 
Sau3AI 
Hpyl78III| 
Alul Mbollj 
CviOTI Apol CviRI Nrul [ | CviJI 

Sthl32I |Tsp509I Msel | Thai | j Hpyl88IX | 

II I I I II I II 

AAGCTCCATAAATTTTAATGCAATCGCGATCAATCTTCCTTCTATTTTACAATCAGAAGC 

1621 + + + + + + 1680 

TTCGAGGTATTTAAAATTACGTTAGCGCTAGTTAGAAGGAAGATAAAATGTTAGTCTTCG 



SUBSTITUTE SHEET (RUL£ 26) 



09/8689 



WO 00/39158 



PCT/CA99/01230 



Figure 9 (continued) 



62/96 



Ppnl 
NgoGV 
NlalV 

Alul Hpyl78III BamHI 
CviJI Acelll | BstYI 
BbvCI | BseMII [ Sau3AI 

BpulOI j BstXI j Alwl | 

Ddel | Mnll | j Msel | | 

I I I I I II I 
CTCAGCTCCAAAGTTCTGGATTTATCCTACATTAACAGGATCCACCTATTCTGAAGACAC 
1681 + + + + + + 

GAGTCGAGGTTTCAAGACCTAAATAGGATGTAATTGTCCTAGGTGGATAAGACTTCTGTG 



Mboll 
Hpyl88IX | 
Alwl | | 



Bbsl 



1740 



. Pa 



Co 



Mboll Eco57I 



NgoGV 
NlalV 
Avail 
Eco0109I 
PspSII 
Sau96I 
SimI 
Hpyl78IIl| 
Ddel | | 
I II 



BseMII 



TTCTTCTACTATCACTCTCTCAGGACCCTTGACTTTTCTAAACGATGAAAATGAAAACCC 

1741 + + + + + + 1800 

AAGAAGATGATAGTGAGAGAGTCCTGGGAACTGAAAAGATTTGCTACTTTTACTTTTGGG 



Q 

SSf, 



Hpyl8 8IX 
Dpnl 
Bglll 



BstYI 
Sau3AI 
Ddel | 
Alul | | 
CviJI | | 
II I 



Taql 
I 



EcoRV 
BseRI | 
Mnll 



Bcgl 
BseRI | 
I I 



Bcgl 
Taql 
Mnll | 
Mnll | j 



1801 



CTATGATAGCTTAGATCTCTCTGAACCTCGAAAGGATATCCCCCCTCCTCTACCTCCTCG 
+ + + + + + 

GATACTATCGAATCTAGAGAGACTTGGAGCTTTCCTATAGGGGGGAGGAGATGGAGGAGC 



1860 



CviRI 

Mnll | Hinfl 

Mnll | j Bcgl Tfil Ddel 

MaeIIl| j j Clal | NspV | Nlalll | 

Tsp45l| j j Taql | Taql |BcgI CviJI | | 

II I I I I I I I III 

ATGTGACTG CAAAAAAATCGATACTTCGAATCTCATTGTAGAAGC CATG AACTTAG ATG A 

1861 +■ + + + + + 1920 

TACACTGACGTTTTTTTAGCTATGAAGCTTAGAGTAACATCTTCGGTACTTGAATCTACT 
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Q 

tu 

m 
to 



Q 



BsiHKAI 
Bspl286I 



Hinf I 
EcoRV Tfil 
I I 



Bsal 
BsmAI 



Bed 



Fokl 
Pflll08I I 

I I 
I I 



Alul 
CviJI 

I 



G CACT ATGG ATAT CAGGG AATCTGGT CTC C CTATTGG ATGGAAACTACG A CTACAACAAG 



1921 



-+ 1980 



Mspl 
BsaWI 
Rsal | 
Taal | j 
Sf cl | | | 
I I II 



CGTGATACCT ATAGT CC CTTAGACCAG AGGGATAACCTAC CTTTG ATG CTGATGTTGTTC 

Sfcl 
BciVI 
Bsrl 
Hinf I 
BspGI 
Acelll 
Bbvl 

Alul Plel | 

CviOTI AccI ] 
Fnu4HI | BsaAI | 
Tsel| j SnaBI | 
Cjel | | |MaeII | j 
I II I III 

CTCTACAGTACCGGAACAGACCAATACAAACCACAGGCAGCTCTACGTAGACTGGACTCC 

1981 + + + + + + 2040 

GAGATGTCATGGCCTTGTCTGGTTATGTTTGGTGTCCGTCGAGATGCATCTGACCTGAGG 

Maelll 
Sthl32I 
Tsp45I 

Mspl 
Neil 

Acil ScrFI | Apol 

Cjel | Bsll|MaeIl| Tsp509I 

II II 

TGTAGGATACCGCCCTAACCCGGAACGTCACGGAGAATTTATTGCTAATACCTTATGGCA 

2041 + + + + + + 

ACATCCTATGGCGGGATTGGGCCTTGCAGTGCCTCTTAAATAACGATTATGGAATACCGT 

Acil 
Hinf I | 

Mwol Tfil |CjePI SfaNI Mnll Mnll 

I I I I I I I 

GTCTGCCTATAACGCTCTGTTAGGAATCCGCATCTTACCTCCACAAAACCTCAAAGAGCA 

2101 + + + + + + 2160 

CAGACGGATATTGCGAGACAATCCTTAGGCGTAGAATGGAGGTGTTTTGGAGTTTCTCGT 
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Aval 
Hinfl 
Mnll | 

CviRI | j | Msel Hpyl78III 

Sthl32I | | j Tsp509I | Nrul 

Plel| j j j CviJI | j Mnll Thai Mwol 

II I I I I I I II I 

TGACCTTGAAGCCTCTCTGCAAGGACTCGGGCTTCTAATTAACCAACATAATCGCGAGGG 

2161 + + --- + — + + + 2220 

ACTGGAACTTCGGAGAGACGTTCCTGAGCCCGAAGATTAATTGGTTGTATTAGCGCTCCC 



CjePI 
Nlalll | CviJI 
I I I 



Sthl32I 
Hpyl8 8IX 
BsmFI | 
CviJI | | 
Hgal | | 
I II 



Fnu4HI 
CviJI | 
BscGI | CviRI | 
BslI I | Tselj 



Bbvl 
BbvCI | 
BpulOI j 
Ddel 



I I 



BseMII 
Fnu4HI 
CviRI 
Tsel 
Sf cl 
BstAPI | 
Mwol | 
Mnll | j 



ACG C AAAGGCTTCCGAAAC C ATACTACG GG CTATG C AG CAACAAC CTCAG CAAAAACTGC 

2221 + + + + + + 2280 

TGCGTTTCCGAAGGCTTTGGTATGATGCCCGATACGTCGTTGTTGGAGTCGTTTTTGACG 



Bfal Maell 



Hinfl 

PstI Bbvl Tf il 

III II 
AGCACGACATAGTTTCTCTTTAGGATTCGCACAAATGTTCTCCAAAACTAGAGAACGTCA 

2281 + + + + + +. 2340 

TCGTGCTGTATCAAAGAGAAATCCTAAGCGTGTTTACAAGAGGTTTTGATCTCTTGCAGT 



Taal 
MboII | 
Hinfl TaqI 
RleAI | BseRI | 
CviRI | | Eco5 7I | | 
Rsal Mnll Plel | j |AciI | j | | | BsmAI 

I I I I II I III II I 

ATCTCCAAGTACGACTTCCTCCCACAACTACTTTGCAGGACTCCGCTTCGACAGTCTCCT 

2341 + + - + + + + 2400 

TAGAGGTT C ATG CTGAAGGAGGG TGTTGATG AAACGTC CTGAGGCGAAGCTGTC AG AGG A 



Dpnl 

Hin4I Bfal Sau3AI 

Mnll | Avrll| HphI | 

Earl | j BsmFI BsaJI | Alul | j 

Bsll| j | Sfcl [CviJI Stylj CviJI j j | Ndel 

III III II III 

CTTCAGGGACTTCATCTCTACAGGGCTATCCCTAGGTTATAGCTACGGAGATCACCATAT 

2401 + + + + + 2460 

GAAGTCCCTGAAGTAGAGATGTCCCGATAGGGATCCAATATCGATGCCTCTAGTGGTATA 
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CO 



Q 



Msel SimI CviJI Msel 

II II 
G CTTTG C C AC T AT AC AG AAAT C TT AAAAGGGT CGTC C AAAG C C TT CT TT AAT AAC C AC AC 

2461 + + + . ■+■ + + 2520 

CGAAACGGTGATATGTCTTTAGAATTTTCCCAGCAGGTTTCGGAAGAAATTATTGGTGTG 



Hpyl7 8III 
CviJI | 
Bsgl | | CjePI 

BslI | j Bfalj CviRI | 
PflMI | j Xbal | j Mnll | j 
III III I I I 



Hinf I 
Tf il 
Bfal 
Alul | 
CviJI | 
HphI | | 
III 



CjePI 
Hpyl78III 
TaqI 
Faul 
Sthl32I | 
Acil | | 

Bpml | | | 

I I II 



TTTGGTAGCCTCTCTAGACTGCACATTCTTACCAGCTAGAATCACCCGCACTCTCGAACT 

2521 + + + + + + 2580 

AAACCATCGGAGAGATCTGACGTGTAAGAATGGTCGATCTTAGTGGGCGTGAGAGCTTGA 

CviJI 
Hael 
Haelll 
StuI 
ScrFI | 
Hhal BsaJI | j 
Cjel |EcoRII | I 



CviJI 



TspRI 
BsrDI | 

I I 



I I 



BstXI 
Bsal | 
BsmAI | 
Mnll | | 
I I I 



CCAGCCCTTTATCAGTGCCATTGCTCTGCGCTGTTCCCAGGCCTCGTTCCAAGAAACTGG 

2581 + + + + + + 2640 

GGTCGGGAAATAGTCACGGTAACGAGACGCGACAAGGGTCCGGAGCAAGGTTCTTTGACC 



Bed 
Bpml 
Fokl | 
Cjel Apol | j 

Bsrl|FokI TspSOSlj j 
II I Ml 



Hin4I 
Dpnl 
Bglll | 
BstYI j 
Sau3AI j 

I I 



CviJI 
Mnll I 



AGAC C ATATAAG AAAATTC C ATCCAAAACATCCC CTTACAGATCTTT C C TCTC C C ATAGG 

2641 + + + + + + 2700 

TCTGGTATATTCTTTTAAGGTAGGTTTTGTAGGGGAATGTCTAGAAAGGAGAGGGTATCC 



BslI 
Pf 1MI 

Hpyl8 8IX Nlalll | 

I I I 

CTTCCGTTCTGAATGGAAAACTTCACATCATATCCCCATGCTATGGACTACGGAAATATC 

2701 + + + + + + 2760 

GAAGGCAAGACTTACCTTTTGAAGTGTAGTATAGGGGTACGATACCTGATGCCTTTATAG 
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Rsal 
BsaAI | 

SnaBI | Hpyl7 8III 

Maellj | CjePI | Hpyl78III CjePI 

III I I I I 

CTACGTACCTACCCTATACAGAAAAAATCCAGAAATGTTCACGACACTACTCATCAGCAA 

2761 + + + + + + 2820 

GATGCATGGATGGGATATGTCTTTTTTAGGTCTTTACAAGTGCTGTGATGAGTAGTCGTT 



Tsp509I 

Bbvl | 

BsmAI I CviRI 

BsmBI | Fnu4HI 

Sthl32l| I Alul| 

Nlalll Tthlllllj | CviJlj 

BsrDI | Bsbl BscGI [j | Tsel j 

III I M I II 

TGGAACATGGACAACACAAGCAACTCCCGTCTCCTATAATTCCGTAGCTGCAAAAATAAA 

2821 + + + + + + 

ACCTTGTACCTGTTGTGTTCGTTGAGGGCAGAGGATATTAAGGCATCGACGTTTTTATTT 



2880 



Bce83I 
CjePI | 
I I 



Maelll 
Hpyl7 8III | 
Smll | j 
I I I 



Ddel 
Bce83I | 
CjePI | | 



Smll 
Alul | 
CviJT j 
BseRI | j 

Ml 



Ace-III 



I 



AAATACTTCCCAACTTTTCTCAAGAGTAACCTTATCCTTAGATTATTCAGCTCAAGTCTC 

2881 -k + + + + + 2940 

TTTATGAAGGGTTGAAAAGAGTTCTCATTGGAATAGGAATCTAATAAGTCGAGTTCAGAG 



Mnll 
Taal 
Sfcl | 
Hindi | j 
BsmAI | j | 

I III 



2941 



BsrDI 
Hinf I 
Ddel | 
Msel Alul | j | CviRI 
BseMII | CviJT j | | Plel Msel 

I I II I I I I 

CTCGTCAACTGTAGGT CAATAC CTTAAAGCTGAG AGTCATTG CAC ATTTTAAC C ACAAAG 

+ + + + 4- + 3000 

GAGCAGTTGACATCCAGTTATGGAATTTCGACTCTCAGTAACGTGTAAAATTGGTGTTTC 



Dpnl 
BstYI 
Sau3AI 
Alwl 
TspRI | 
CviRI | 1 
Taal | | | 
I I I I 



Mmel 
MboII | 
Ddel | j 
III 



AAAACATCAAGGAATAAACAGTGCAAAATAACAGATCCCTTAGTAAATCTTCCTTCTTTG 

3001 + + + - + + + 3060 

TTTTGTAGTTCCTTATTTGTCACGTTTTATTGTCTAGGGAATCATTTAGAAGGAAGAAAC 
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Figure 9 (continued) 

Tsp509I 
Msel | 
Cvi JI | | 
NgoGV| I | 
Nlaivj I | 
I! II 

TTGG AG C CTTAATTTTAGGTAAAACTAC AAT A 

3061 -i- + + -- 3092 

AACCTCGGAATTAAAATCCATTTTGATGTTAT 
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SEQ ID NO. 10) 



Msel 
Vspl 
Tsp509I | 
Msel | I 

Taal | | | 

II II 

AAACAGTTAAATAATTAATAGACAATAATCTATTCTTATTGACTTCTTTTTTTCTTGTTT 
1 + + + + + + 60 

TTTGTCAATTTATTAATTATCTGTTATTAGATAAGAATAACTGAAGAAAAAAAGAACAAA 

Apol 
Tsp509I 

Msel NspV | 

Msel Mnll | Nlalll TaqI j 

I II III 

ATTAAAGTTGCTTCAACCTTATTGATTTAACGAGGAAACCATGACCATACTTCGAAATTT 

61 4. + + + + + 120 

TAATTTCAACGAAGTTGGAATAACTAAATTGCTCCTTTGGTACTGGTATGAAGCTTTAAA 



Fnu4HI 
Tsel | 
PstI | 
Fnu4HI | 
CviRI | 
Tsel | 
Sf cl | 
Mnll | | 
Mwolj j 
II I 

TCTTACCTGCTCGGCTTTATTCCTCGCTCTCCCTGCAGCAGCACAAGTTGTATATCTTCA 

121 + + + + + + 180 

AGAATGGACGAGCCGAAATAAGGAGCGAGAGGGACGTCGTCGTGTTCAACATATAGAAGT 



BspMI 
CviJI 

I 



Bbvl 

Bbvl | Hpyl7 8III 
MboII | Real | 
II II 



Ddel 
Alul | 
CviJI | 
Hindlll | | Tsp509I 

II I I I III I 

TGAAAGTGATGGTTATAACGGTGCTATCAATAATAAAAGCTTAGAACCTAAAATTACCTG 

181 + + + + + + 240 

ACTTTCACTACCAATATTGCCACGATAGTTATTATTTTCGAATCTTGGATTTTAATGGAC 



MslI 
Nlalll I 



Bed Psil Taal 



Btrl 
Mae I I | 
Mnll | j 

Hpyl78III | | j Msel 
Bfal [ j || Acll | 

Hpyl78III Xbal|| | || Maell j 

I III I II II 

TTATCCAGAAGGAACTTCTTACATCTTTCTAGATGACGTGAGGATTTCCAACGTTAAGCA 

241 + + + + + + 300 

AATAGGTCTTCCTTGAAGAATGTAGAAAGATCTACTGCACTCCTAAAGGTTGCAATTCGT 
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Hpyl78III 
Dpnl 
Nlalll 
Bell j 
Sau3AI j 
SfaNI | 



Mmel 



Dpnl 
Sau3AI | 
Clal| j Hinfl 
MboII Psil Taql| | Tfil 



Nlalll 



301 



TGATCAAGAAGATGCTGGGGTTTTTATAAATCGATCTGGGAATCTTTTTTTCATGGGCAA 
+ + + + + + 

ACTAGTTCTTCTACGACCCCAAAAATATTTAGCTAGACCCTTAGAAAAAAAGTACCCGTT 



CviRI 

BstAPI | BbvI 

Mw °I I BsaJI | 

Taal| | Mnll || 

Ml I || 

CCGTTGCAACTTCACTTTTCACAACCTTATGACCGAGGGTTTTGGCGCTGCCATTTCGAA 



CjePI 
Fnu4HI 
Haell 
Hhal | 
Taqll j 
Tsel j 
Mmel I I 



NspV 
TaqI 



361 



^ + + + + + 420 

GGCAACGTTGAAGTGAAAAGTGTTGGAATACTGGCTCCCAAAACCGCGACGGTAAAGCTT 



CjePI 
BsmAI | 
Thai | 
Acil | I 



CjePI Tsp509I 



Ddel 
HphI 
CjePI | 



Bce83I 
BbvCI | 
BpulOI j 
Ddel 



421 



CCGCGTTGGAGACACCACTCTCACTCTCTCTAATTTTTCTTACTTAGCGTTCACCTCAGC 
+ + + + + + 

GGCGCAACCTCTGTGGTGAGAGTGAGAGAGATTAAAAAGAATGAATCGCAAGTGGAGTCG 



Mnll 
Smll 
BseMII | 
Mnll | | 



Mnll 



HaelV 
Hin4I 



NgoGV 
NlalV 
Drdlll 



TaqI 
Dpnl | 
Sau3AI I Mnll 



481 



ACCTCTACTACCTCAAGGACAAGGAGCGATTTATAGTCTTGGTTCCGTGATGATCGAAAA 
+ + + + + + 

TGGAGATGATGGAGTTCCTGTTCCTCGCTAAATATCAGAACCAAGGCACTACTAGCTTTT 



541 



Maelll Cjel 

RleAI CjePI | 

Tsp45I Earl | Fnu4HI 

Bsp24I I Bsp24l|| Alul| 

C ^ eI I MboII BbvI I)) CviJlf Hin4I 

c 3 ep I I Cjel | AceIIl| IK xsel| Cjel I 

' 1 II M III III 

TAGTGAGGAAGTGACTTTCTGTGGGAACTACTCTTCGTGGAGTGGAGCTGCGATTTATAC 



+ + + + + + goo 

ATCACTCCTTCACTGAAAGACACCCTTGATGAGAAGCACCTCACCTCGACGCTAAATATG 
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Acil 
MspAlI 
Ddel | 
Faul I 



Ddel 
Eco57I 



Sthl32I I 



Hinf I 



Plel 



I n 



EcoRII 
SexAI 
BseMII 
Acil 
Mwol | 



| NgoGV I | 
| NlalVj j 

I I ! II I I III 

TCCCTACCTTTTAGGTTCTAAGGCGAGTCGTCCTTCAGTAAATCTCAGCGGGAACCGCTA 

601 + + + + + 660 

AGGGATGGAAAATCCAAGATTCCGCTCAGCAGG7UVGTCATTTAGAGTCGCCCTTGGCGAT 



fa 



Haell 
Hhal 
NgoGV 
NlalV 
BsaHI 
Narl 
BanI 
Fnu4HI [ 

BsmAI Taul j 

ScrFI | Acil | 

BslI | j CviJI BstXI | | 

II I I I II 
CCTGGTGTTTAGAGACAATGTGAGCCAAGTTTATGGCGGCGCCATATCTACCCACAATCT 
g61 + + + + + + 7 2o 

GGACCACAAATCTCTGTTACACTCGGTTCAAATACCGCCGCGGTATAGATGGGTGTTAGA 



£3 



Avail 
EcoO109I 
PspSII 
Sau96I 
Sse8647I 
TaqI 
Aval 



Smll 
Xhol 
Hinf I | 
Hpyl78ril 
Mnll 
RleAI 
Plel | 

I I 



Btrl 
Maell | 
Nlallll 
Hpyl78III | 
CjePI | | 
Nlalll Real j j 
I III 

CACACTCACGACTCGAGGACCTTCGTGTTTTGAAAATAATCATGCTTATCATGACGTGAA 

721 + + + + + + 780 

GTGTGAGTGCTGAGCTCCTGGAAGCACAAAACTTTTATTAGTACGAATAGTACTGCACTT 
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Alwl 
Dpnl 
Sau3AI 
Clal 
TaqI 
Dpnl 
Hin4I 
Sau3AI 
ScrFI 
EcoRII 
BsrDI Mnll 
CviJI | BseRI | 
NgoGV| | CjePI | | 
Mnll Nlaivj | BsrDI | Mil I III I I Bpml Acil 
I III II I I 
TAGTAATGGAGGAGCCATTGCCATTGCTCCTGGAGGATCGATCTCTATATCCGTGAAAAG 
781 + + + + + + 

ATCATTACCTCCTCGGTAACGGTAACGAGGACCTCCTAGCTAGAGATATAGGCACTTTTC 



840 



Hin4I 
Dpnl 
Bglll 
BstYI 
MboII 

Sau3AI j j SfaNI Fokl CjePI 

I I I 

CGGAGATCTCATCTTCAAAGGAAATACAGCATCACAAGACGGAAATACAATACACAACTC 

841 + + + + + + 900 

GCCTCTAGAGTAGAAGTTTCCTTTATGTCGTAGTGTTCTGCCTTTATGTTATGTGTTGAG 



CjePI 
Msel 
Taal 
BsiHKAI 
Bspl286I 
Hpyl78III | 
Bed Xcml | | 
Bed |CviRl| j I 



I 



Hpyl78III 
Mspl 
Bsawi 
BspEI 

BsaAI Hinfl | 

Mae I I | Tfil j 

Bpml | j Hpyl88IX | j 
III II I 



901 



CATCCATCTGCAATCTGGAGCACAGTTTAAGAACCTACGTGCTGTTTCAGAATCCGGAGT 

+ H H H + + 

GTAGGTAGACGTTAGACCTCGTGTCAAATTCTTGGATGCACGACAAAGTCTTAGGCCTCA 



960 



Bsp24I 

CjePI 

Cjel| 
Dpnl | j 

Sau3AI | j j Tsp509I 
Alwl | III CviJI Hinfl Plel | 

I I III I I II 

TTATTTCTATGATCCTATAAGCCATAGCGAGTCGCATAAAATTACAGATCTTGTAATCAA 

961 + + + + + + 1020 

AATAAAGATACTAGGATATTCGGTATCGCTCAGCGTATTTTAATGTCTAGAACATTAGTT 



Dpnl 
Cjel 
CjePI 
Bglll | 
Bsp24I j 
BstYI j 
Sau3AI I 
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Figure 10 (continued) 



Hpyl78III 
Ddei 
Alul" 
Bsp24I 

Cjel | | BseMII 
Tsp509I CjePI | j ScrFI | 

Hpyl78III Eco57I | CviJI j j EcoRII I 

I III 
TGCTCCTGAAGGAAAGGAAACTTATGAAGGAACAATTAGCTTCTCAGGACTATGCCTGGA 

1021 + + + + + + 1080 

ACGAGGACTTCCTTTCCTTTGAATACTTCCTTGTTAATCGAAGAGTCCTGATACGGACCT 



Cm 
CO 



Cjel 
CjePI 
Bsp24I 
Fokl 
Nlalll 
Hpyl78III 
Real 
Dpnl | 
Bell | | 
Sau3AI | | 



Maelll 
Tsp45I 



Mnll 



Acil 
I 

TGATCATGAAGTTTGTGCGGAAAATCTTACTTCCACAATCCTACAAGATGTCACATTAGC 
1081 + + + + + + H40 

ACTAGTACTTCAAACACGCCTTTTAGAATGAAGGTGTTAGGATGTTCTACAGTGTAATCG 



£3 



BstEII 
Maelll 
BplI | 

Ppil Bed | | 

Hin4I | Hpyl8 8IX | j j 

II I III III I 

AGGAGGAACTCTCTCTCTATCGGATGGGGTTACCTTGCAACTGCATTCTTTTAAGCAGGA 



CviRI BsmI 
Fokl I CviRI 



Msel 



1141 



•+ 1200 



TCCTCCTTGAGAGAGAGATAGCCTACCCCAATGGAACGTTGACGTAAGAAAATTCGTCCT 



Bpml 
Alul | 
CviJI j 
CacSI | j 

I I I 



Drdll 
NgoGV 
NlalV 
BsmAI 
ScrFI 
EcoRII | 
BsaXI | | 

I I I 



StM.321 
Hpyl78III 
BpulOI | 
Ddel | 
SfaNI I 



BseMII 
Aval | 
Bsglj | 

II I 



AGCAAGCTCTACGCTTACTATGTCTCCAGGAACCACTCTGCTCTGCTCAGGAGATGCTCG 

1201 + + + + + + 1260 

TCGTTCGAGATGCGAATGATACAGAGGTCCTTGGTGAGACGAGACGAGTCCTCTACGAGC 
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Hpyl78III 
CviRI 
AlwNI 



Hinf I 
Tf il 
Hpyl8 8IX | 
Fokl 1 I 



1261 



Thai 
Mnll | 
Hinfl | j 
MboII Tfil | | 

GGTTCAGAATCTGCACATCCTGATTGAAGATACCGACyVACTTTGTTCCTGTAAGGATTCG 

+ + + + + ^ 1320 

CCAAGTCTTAGACGTGTAGGACTAACTTCTATGGCTGTTGAAACAAGGACATTCCTAAGC 



Mnll 
CjePI | 



1321 



SfaNI 

BsaJI | BsmAI 
Hhal | | Fokl | 

"I II I I I | | 

CGCCGAGGACAAGGATGCTCTTGTCTCATTAGAAAAACTTAAAGTTGCCTTTGAGGCTTA 



Msel 



CviJI 
Bgll | 
Mwol 



^ + + + + + 1380 

GCGGCTCCTGTTCCTACGAGAACAGAGTAATCTTTTTGAATTTCAACGGAAACTCCGAAT 



Mnll 
Earl | 
Hpyl78III 



Hinfl 

Avail Tsp50 9I Mnll Tfil 

Sau96I CjePI |MseI | CviJI MboII I 

I i III I ii 

TTGGTCCGTCTATGACTTTCCTCAATTTAAGGAAGCCTTTACGATTCCTCTTCTTGAACT 

+ + + + + + 1440 

AACCAGGCAGATACTGAAAGGAGTTAAATTCCTTCGGAAATGCTAAGGAGAAGAACTTGA 



1381 



1441 



Bfal 
Bsal 
BsmAI 
Avrll | 
BsaJI | 
Styl| 

III I I II 

TCTAGGGCCTTCTTTTGACAGTCTTCTCCTAGGGGAGACCACTTTGGAGAGAACCCAAGT 

+ + + + + + 1500 

AGATCCCGGAAGAAAACTGTCAGAAGAGGATCCCCTCTGGTGAAACCTCTCTTGGGTTCA 



CviJI 
Haelll 
ECOO109I | 
Sau96I | 
Bfal | | 



Bbsl 
MboII Taal 



Maelll 
Tsp45I 



BslI 
Alul| 
Bsllj 
CviJI | 
BpulOI | | 
Ddel j | 
NgoGV | | | 
NlalV | j j 
Avail || || 
Sau96I j j Earl | Rsal MboII 

CACAACAGAGAATGACGCCGTTCGAGGTTTCTGGTCCCTAAGCTGGGAAGAGTACCCCCC 

isoi + + + + + + 1560 

GTGTTGTCTCTTACTGCGGCAAGCTCCAAAGACCAGGGATTCGACCCTTCTCATGGGGGG 



Beef I 
I 



Hgal 
Taql 
BsmFI | 
Mnll | | 
BsaHI | j | 
I I 
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Figure 10 (continued) 



Mnll 
Hinfi| 

Hpyl78III Dpnl Ddel Tf il j 

BslI | Sau3AI | Alwl | Taal BseMII | j 

II III! I III 

TTCTCTGGATAAAGACAGAAGGATCACACCAACTAAGAAAACTGTTTTCCTCACTTGGAA 



1561 



-+ 1620 



AAGAGACCTATTTCTGTCTTCCTAGTGTGGTTGATTCTTTTGACAAAAGGAGTGAACCTT 



Dpnl 
Sau3AI | 
Ddel | j 
Hpyl78III | j 

III II III 

TCCTGAGATCACTTCTACGCCATAATCTCTAAGTCTACACTATAATTAAGGGAATCCCCT 

1621 + + + -- + + + 1680 

AGGACTCTAGTGAAGATGCGGTATTAGAGATTCAGATGTGATATTAATTCCCTTAGGGGA 



Msel Hinfl 
Ddel Accl Tsp509I | Tfil 



CO 

in 

09 



MboII NgoGV 

NgoGV| NlalV 

Nlaivj Avail 

Avail | j EcoO109I 

EcoO109l|| Hpyl88IX | 

PspSIljj BsmFI | PspSII 
Msel Sau96l|| BsmFI | j Sau96I 

I Ml I I I I 

TTAAGAAG ATTTTGGG ACCTATCTGTATT CAG AGATAGGTCC CTCTATG C AC ACATGTTC 

1681 + + + + + + 1740 

AATTCTTCTAAAACCCTGGATAGACATAAGTCTCTATCCAGGGAGATACGTGTGTACAAG 

Hpyl78III 



BssSI 
Nlalll 
Afllll | 
BspLUllI j 
Mnll | j 
CviRI | [Nspl 

III I 



ACGAG 

1741 1745 

TGCTC 
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Figure 11 

Restriction enzyme analysis of CPN100985 (RY 66 - SEQ ID NO. 11) 

Hpyl78III 
Dpnl 
Mill I 
Sau3AI | 
Msel | | 
NlalV I 



Bpull02I 

Ddel BspMI 



Cvi JI | Bpml | CviRI 

II II I 

TTTGGAACCTTAATGATCTCTGGAGGGTGGCTTAGCAATATGATTTTACGCTTTGCAGGT 

1 + + + + + + 60 

AAACCTTGGAATTACTAGAGACCTCCCACCGAATCGTTATACTAAAATGCGAAACGTCCA 

Alul Hinfl 

Hpyl88IX CviJI Tfil Cjel 

i ill 

CAGATTTTCCAAAACTTCTATAAATGGAAATAAAGAGCTTATGGGAATCTCTCTACCAGA 

61 + + + + + + 120 

p GTCTAAAAGGTTTTGAAGATATTTACCTTTATTTCTCGAATACCCTTAGAGAGATGGTCT 

1% Bfal CviJI 

% Avrll| Cjel Haelll' 

Vl Alul BsaJlj Fokl| Mspl | 

CviJI Stylj Ddel Mmel j Tthlllll | |MnlI 

I II III I I I I 

GCTTTTTTCCAACCTAGGTTCTGCTTACTTAGATTATATCTTTCAACATCCTCCGGCCTA 

%] 121 + + + + + + 180 

= CGAAAAAAGGTTGGATCCAAGACGAATGAATCTAATATAGAAAGTTGTAGGAGGCCGGAT 

Sthl32I Alul 
:J MboII BscGI | CviJI 

H BslI I CviJI | j Sfcl | 

It II III II 

I J TGTTTGGTCAGTTTTTCTTCTTTTATTAGCCCGTCTGCTTCCTATTTTTGCTGTAGCTCC 

M 181 + + - + + + + 240 

ACAAACCAGTCAAAAAGAAGAAAATAATCGGGCAGACGAAGGATAAAAACGACATCGAGG 

Hpyl78III 
Mill I 

Alul Msel | 

BslI CviJI BplI | j 

Ddel | Hin4I | Sthl32l| j j 

II I I M I I 

CTTCTTAGGAGCAAAGCTCTTTCCCTCCCCTATTAAAATCGGGATTAGTCTCTCTTGGCT 

241 + + + + + + 300 

GAAGAATCCTCGTTTCGAGAAAGGGAGGGGATAATTTTAGCCCTAATCAGAGAGAACCGA 



Cac8I 
CviJI | 
BsmAI | j 
I I I 



Ecil Tsp509I 
Acil| Dpnl | BsaBI 

CviRI BciVI | | Sau3AI | | Nlalll | 

I III III II 

TGCAATCATCTTTCCAAAAGTCTTGGCGGATACGCAGATCACAAATTACATGGATAACAA 

301 -f- + + + + + 360 

ACGTTAGTAGAAAGGTTTTCAGAACCGCCTATGCGTCTAGTGTTTAATGTACCTATTGTT 
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361 



Dpnl 
Bell | 

Sau3AI j CviJT 

I I I 
TCTCTTTTATGTTTTACTTGTGAAGGAGATGATCATAGGCATTGTGATAGGCTTTGTTTT 
1 1 + + h 

AGAGAAAATACAAAATGAACACTTCCTCTACTAGTATCCGTAACACTATCCGAAACAAAA 



420 



Alwl 
HaelV 
Hin4I 
Dpnl 

. CviRI BstYI | 

Bbvl Fnu4HI | Sau3AI | | | Hinfl 

Bsgl| Tsel| | Mwol I I I I Tfil 

II II I I I I 

AGCATTTCCCTTTTATGCTGCACAATCGGCAGGATCTTTCATCACTAACCAACAAGGGAT 

421 + + + + + + 480 

TCGTAAAGGGAAAATACGACGTGTTAGCCGTCCTAGAAAGTAGTGATTGGTTGTTCCCTA 



Fokl Hhal Nlalll 

Mnll | Thai Hin4I Acil Mnll| 

III I I II 

TCAGGGTTTAGAGGGCGCGACATCCCTGATTTCCATTGAGCAGACCTCTCCGCATGGCAT 

481 + + + + + + 540 

AGTCCCAAATCTCCCGCGCTGTAGGGACTAAAGGTAACTCGTCTGGAGAGGCGTACCGTA 

BstEII 

Hpyl78III Maelll 
MaeIIl| Tsp4 5I 

BplI Tsp45l| Taqll HphI | Taal 

I II I I I I 

TTTATACCATTACTTCGTGACTATTATTTTTTGGTTAGTGGGTGGTCACCGTATTGTAAT 

541 + + + + + + 600 

AAATATGGTAATGAAG CACTG ATAATAAAAAACCAATCAC CCACCAGTGGCATAAC ATTA 



Dpnl 
Sau3AI 
Hpyl88IX| 
Alwl | | 
XmnI | | | 

II II 

CTCTTTGTTATTGCAAACTCTTGAAGTCATTCCGATCCATAGTTTCTTTCCTGCCGAGAT 

601 + + + + + + 660 

GAGAAACAATAACGTTTGAGAACTTCAGTAAGGCTAGGTATCAAAGAAAGGACGGCTCTA 



Hpyl7 8III 
CviRI | 

I I 
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Msel 
AflII| 

Smll |Bspl286I 
Alul| j Bmgl |Sthl32I 
CviJlj j BseSI | BslI | 
111 II II 



Hpyl78III 
Dpnl | 
Bell | j 
Sau3AI j | 
I I I 



Acelll 
Alul BsmAI 
CviJI Hpyl78III 
Cac8I | BssSI | 
II II 



GATGAGCTTAAGTGCCCCGATTTGGATTACTATGATCAAGATGTGCCAGCTCTGTCTCGT 
661 + + + + + + 

CTACTCGAATTC ACGGGGCTAAAC CTAATG ATACTAGTTCTACACGGTCG AG AC AG AG CA 



720 



m 

to 

vy 



Ddel 
Alul | 
CviJI | 
MspAlI | 



Mwol 
Alul 
CviJI 
PstI 



Fnu4HI 
CviRl| 
Mwol j 
Tsel j 
Sfcl || 
BsiHKAI | I I 



BseMII PvuII |Bspl286I | || 
I II I I II 



Hpyl8 8IX 
Msel | 
Bbvl | j 

I I I 



Ddel 



GATGACCATACAGCTGAGTGCTCCTGCAGCTTTGGCGATGTTAATGTCCGACCTATTCTT 

721 + + + + + + 780 

CTACTGG TATGTCG ACTC ACG AGGACGTCGAAAC CGCTACAATTACAGG CTGG ATAAGAA 



Taal 

Mmel | Smll 
Bce83I J j NlalV | 
Msel | | BanI j j 
III III 



Bgll 
Mwol 
Msel 
Af III j 
Mnll j 
Smll j 
Mnll | j 



BseRI 
Mnll | 

I I I 
AGGGATTATTAACCGTATGGCACCTCAAGTTCAGGTCATCTACCTCCTCTCTGCCCTTAA 

781 + + + + + + 840 

TCCCTAATAATTGGCATACCGTGGAGTTCAAGTCCAGTAGATGGAGGAGAGACGGGAATT 



SimI 
Nlalll | 
Bbsl j j 

MboII j | 

CviJI | | j 

II II 



Msel 
Tsp509I | 
Drdll | j 



841 



ScrFI 
BsaJI | 
HphI EcoRII |BslI 

I III III 
GGCTTTCATGGGTCTTCTCTTTCTCACCCTGGCGTGGTGGTTCATAATTAAGCAGATAGA 
+ + + + + + 

CCGAAAGTACCCAGAAGAGAAAGAGTGGGACCGCACCACCAAGTATTAATTCGTCTATCT 



900 
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Tthlllll 



I 



BsmFI 



Drdll 
I 



Bfal 
Avrll | 
BsaJI | 
Styl| 
Bce83I | |NlaIV 

III I 



Smll 



901 



TTATTTCACTCTTGCTTGGTTCAAAGAAGTCCCCATTATGCTCCTAGGTTCCAACCCTCA 

_ H H + h + + 

AATAAAGTGAGAACGAACCAAGTTTCTTCAGGGGTAATACGAGGATCCAAGGTTGGGAGT 



960 



CviJI 
Bfal 
Mrnel 
Avrll | 
BsaJI I 
Styl| 



Hpyl78III 
SfaNI 
Hinf I 
Hpyl78III 
Maelll | 
Tsp45I j 
Plel | | 
I II 



Mnll 

Rsal | Avrll | | MaeIII| | | | Bpml 

Seal j BsaJI I j Tsp45l| j | j Hhal Hinf I | 

TatI | j Stylj j Plel || j j j Hin4I | Tfilj 

III III I II I I I I I II 
AGTACTCTAATCCCCTAGGCTCTTATCGTGACTCTTATCTGGAGATGCGCTCACTTACGA 
96! + + + + + + 1020 

TCATGAGATTAGGGGATCCGAGAATAGCACTGAGAATAGACCTCTACGCGAGTGAATGCT 



BplI TspRI Cjel 

Ddel | Taal | Hinf I | 

Cjel | j Hhal | j Ddel Tfil j Ddel 

I I I I I I III I 

ATCTTAGCGCACTGTTTATGGATTATCTTAGGGAATCTCTCGCATATTCTTTTGTAATCT 

1021 + + + + + + 1080 

TAGAAT CG CGTGACAAATACCTAATAGAAT CCCTTAGAGAG CGTATAAGAAAACATT AGA 



f^ 3 



Hpyl78III 
Hinfl Apol | 
Tfil Tsp509I | 

I I I 

AAGAATCTATAAATTCAAGA 

1081 + + 1100 

TTCTTAGATATTTAAGTTCT 
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Figure 12 

Restriction enzyme analysis of CPN100987 (RY 67 - SEQ ID NO. 12) 
Hinf I 

Hin4I | SfaNI BsrDI 
TspRI | |Bfal| Tsp509l| 
BsrI Plel | | |Bpml| Hpyl78III Mnll || 

I I ! I I II .1 I II 
CCAGTGATAAAGACTCTAGTGATAAAGATGCTCCAGAAGGAAGCAATGAAATTGAGGGTG 

1 + + + + + + 50 

GGTCACTATTTCTGAGATCACTATTTCTACGAGGTCTTCCTTCGTTACTTTAACTCCCAC 

Hpyl78III 

Maelll Hpyl78III | Bpml 

Tsp45I Bf al | I BsaJI | 

Ddel | Bsbl Xbal | j MslI Styl j 

II I III I II 
CTTAGTGACTGCCAACACTTTTGGAACTCTAGACATCTTGATGAAGCACTCCAAGGAAGA 

61 + + + + + + 120 

GAATCACTGACGGTTGTGAAAACCTTGAGATCTGTAGAACTACTTCGTGAGGTTCCTTCT 
ScrFI 



Hinf I 

CjePI | Sthl32I 

BseRl| j Mnll | 

Mnll MboII Fokl|TfiI Hpyl78III | j 

II II I III 

TGACCTCTCCAGGTTTCTTCCTAAAAATCTTCTTGTTGAATCTCCTCATCCCGAAGAAAT 

121 -t- + + + + + 180 

ACTGGAGAGGTCCAAAGAAGGATTTTTAGAAGAACAACTTAGAGGAGTAGGGCTTCTTTA 



EcoRII 
HgiEII 
Hin41 | 
CjePI |MboII 
I I I 



Dral 

MboIl| CviJI 
Mselj Fokl| Tsp509I tflalll 

II Ml I 
CCCTTTAAAATCTTTATCTTTTACGATGAGTTGGCTACCTACAATTCATCCTTCATGGAT 
181 + + + + + + 240 

GGGAAATTTTAGAAATAGAAAATGCTACTCAACCGATGGATGTTAAGTAGGAAGTACCTA 

BsaJI 

Nlalll Styl 
BsrDI MslI |XmnI Hpyl78III Mnll|Tsp509I 

I I I I I II I 

TAC CATTG CCATGAAAGAGTTCC CTC CTGAAATC CAAGGTCAATTATTAGCGTGGTTG C C 

241 + + + + + + 300 

ATGGTAACGGTACTTTCTCAAGGGAGGACTTTAGGTTCCAGTTAATAATCGCACCAACGG 

Apol CviJI 
Tsp509I ScrFI SfaNI | 

CviJI Hpyl78III | EcoRII | Sfcl | j MslI 

I II II III I 

AGAGCCTTTAGTTCAAGAAATTCTACCCTTACTGCCTGGCATCTCTATAGCCCCACATCG 

301 + + + + + + 360 

TCTCGGAAATCAAGTTCTTTAAGATGGGAATGACGGACCGTAGAGATATCGGGGTGTAGC 
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CviJI 
MboII 
NlalV 
Hpyl8 8IX 
RleAI 
BsiHKAI 



Bspl286I 
BseSI | 
CviRI | 
ApaLI | | 
I I I 



Dpnl 
BstYI | 
Sau3AI 



Hpyl78III 
Bfal | 

Xbal | j Ddel Alwl | | 

III I I I I 

CTGTGCACCTTTCGGAGCCTTCTATCTTCTAGATATGCTAAGTAAAAAGATCCGTCCTTG 

361 + + + + + + 420 

GACACGTGG AAAG CCTCGG AAG ATAGAAGATCTATACGATTC ATTTTT CTAGG CAGG AAC 



Tsp509I 



421 



SfaNI 
Mwol 

XmnI BbvCI | 

Fokl | BpulOlj | BseMII 

MboII | (MboII CviRI Ddel j | Mnll 

I I I I I III 

TGGAATTACAGAAGAAATCTTTCTTCCTGCATCCTCAGCAAATGCTATACTTTACTATAC 

h ,. + H h + 

ACCTTAATGTCTTCTTTAGAAAGAAGGACGTAGGAGTCGTTTACGATATGAAATGATATG 



480 



AlwNI 
Avail 
ECOO109I 
PspSII 
Sau96I 
Sse8647I 



Bfal 
Avrll | 
BsaJI j 
Styl| 



481 



541 



Dpnl 
Sau3AI | Msel 

III II 
AGGTCCTGTAAAGATCGCTTTAATCAACTGCCTAGGTCTTTATTCTATTGCTAAAGAGTT 
h + + + + + 

TCCAGGACATTTCTAGCGAAATTAGTTGACGGATCCAGAAATAAGATAACGATTTCTCAA 

MboII 

Hpyl78III BsmI | Sfcl 

I III 
GAAG CACATTCTGGATAAGGTTGTG ATTGAACGAGTGAAGAATG CTCTCT C CC CTAC AG A 

h h + h + -h 

CTTCGTGTAAGACCTATTCCAACACTAACTTGCTCACTTCTTACGAGAGAGGGGATGTCT 



540 



600 



MboII 
Apol | 
Tsp509I j 

Fokl Hpyl88IX PflllOSI | j 

I I II I 

GAAACTCTTTCTTACCTACTGCCAATCTCATCCGATGAAACATTTAGAAACTACGAATTT 

601 + + + + + + 660 

CTTTGAGAAAGAATGGATGACGGTTAGAGTAGGCTACTTTGTAAATCTTTGATGCTTAAA 
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Figure 12 (continued) 

Tsp509I 

Sf aNI CviRI | Taal 

I II I 

TCTTTCTTCTTGGACTACTGATGCAGAATTACGACAGTTCGTTCATAAGCAAGGGTTAGA 

661 + + + + + + 720 

AGAAAGAAGAACCTGATGACTACGTCTTAATGCTGTCAAGCAAGTATTCGTTCCCAATCT 

Taqll 
BsaAI | 
SnaBI j 

Msel Maell| j 

I III 
GTTTTTAGGTAAAGCATTAACAAAAGAAAACGCTTCTTTTCTATGGTATTTTCTACGTAG 

721 + + . + + + + 780 

CAAAAATCCATTTCGTAATTGTTTTCTTTTGCGAAGAAAAGATACCATAAAAGATGCATC 

Fokl 

BsiEI Dral | MslI 

Taql Hin4I TaqI Msel | |NlaIII Bed | 

II I II I I II 

GTTAGATGTCGGTCGAGCATATATCGTCGAGCAGACTTTAAAAACATGGTATGACCATCC 

781 + + + + 4- + 840 

CAATCTACAGCCAGCTCGTATATAGCAGCTCGTCTGAAATTTTTGTACCATACTGGTAGG 

Faul 

Sthl32l| Nlalll 
Bfal | | Nsil | 

BsmFI Msel Acil | j | CviRI | j Ddel Hindlll 

I I I I II I I I I I 
CTATGTGGATTATTTTAAGTCCCGCCTAGAACAATGCATGAAAGTCTTAGTGAAATAAAA 
841 + - + + + + + 900 

GATACACCTAATAAAATTCAGGGCGGATCTTGTTACGTACTTTCAGAATCACTTTATTTT 

Alul Alul 
CviJI CviJI 
I I 

GCTTTATAAG TAAAGATTTAG CTTTATACAAAGTATAGAAAAATAAC ACG 

901 + -- + + + + 950 

CGAAATATTCATTTCTAAATCGAAATATGTTTCATATCTTTTTATTGTGC 
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Figure 13 

Restriction enzyme analysis of CPN100988 (ry68 - SEQ ID NO. 13) 



Maelll 

Dral | Dpnl AccI 

Maelll Mselj | Sau3AI | Nlalll | Bed Fokl 

I Ml II II I | 

CGATTT CGTTACCTTTAAAGTTACTTTTG ATCGTCATGGTAGACGGATGGACATTACTG C 



- + 60 



G CTAAAG CAATGGAAATTTCAATGAAAACTAG C AGTACCATCTG CCTACCTGTAATG ACG 



Dral 
Msel 
Alul 
CviJI 
Dpnl 
Bell 



Sau3AI 
CviJI 
Mwol | 
BsaJI | | 
Styl | | 



BsaAI 
Pmll 
Maell| 
Af 1III | I 

II 



Mwol 
Nlalll I 



Mwol 



Bf al 
Spel| 
II 



61 



TCCAAGGGCTTATGATCAGCTTTAAATAAGGACACGTGCCATGTTAGCATTTTTCGCAAC 

H + 1 + -j + 

AGGTTCCCGAATACTAGTCGAAATTTATTCCTGTGCACGGTACAATCGTAAAAAGCGTTG 



12 0 



Rsal 
Seal 
TatI | 



121 



TAGTTTCAAATCTGTTCTTTTTGAGTACTCCTACCAATCATTATTACTTATTTTGATTGT 



■ + 180 



ATCAAAGTTTAGACAAGAAAAACTCATGAGGATGGTTAGTAATAATGAATAAAACTAACA 



181 



Sthl32I 

Alul | Acil 

CviJI j Dpnl Fnu4HI 

NlalV Ddel | | Hpyl7 8III Sau3AI | Taul 

BanI | Bed Mnll | j | BslI | MboII I I Cvi JI | 

M I II I I II III | | 
TTCGGCACCTCCCATCATCTTAGCTTCCATAGTCGGGATTATGGTTGCGATCTTCCAAGC 



AAGCCGTGG AGGGTAGTAGAAT CGAAGGTATCAG CCCTAATACCAACG CTAGAAGGTT CG 



240 



Bsbl 



CviRI 



Hpyl78III 
Bfal | 
Spel| | 



CGCAACACAAATCCAAGAACAGACCTTCGCTTTTGCAGTCAAACTAGTCGTGATTTTTGG 

241 + + + + - --- + + 300 

GCGTTGTGTTTAGGTTCTTGTCTGGAAGCGAAAACGTCAGTTTGATCAGCACTAAAAACC 
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Figure 13 (continued) 

Hpyl78III 
Dpnl | 
Mnll j 
Sau3AI | | Bpull02I 

Msel | [ j Ddel BspMI Hpyl88IX 

NlalV III j CviJl| Bpml I CviRI I 

I I I I I II II II 
AACCTTAATGATCTCTGGAGGGTGGCTTAGCAATATGATTTTACGCTTTGCAGGTCAGAT 
301 - + + — - + + + + 360 

TTGGAATTACTAGAGACCTCCCACCGAATCGTTATACTAAAATGCGAAACGTCCAGTCTA 

Alul 

Alul Hinfl CviJI 
CviJI Tfil Cjel| 

II II 
TTTCCAAAACTTCTATAAATGGAAATAAAGAGCTTATGGGAATCTCTCTACCAGAGCTTT 
361 + + + + + + 4 2o 

AAAGGTTTTGAAGATATTTACCTTTATTTCTCGAATACCCTTAGAGAGATGGTCTCGAAA 

Bfal CviJI 

Avrll) cjel Haelll 

BsaJlj Fokl| Mspl | BslI 

Stylj Ddel Mmelj Tthlllll I Mnll [ 

M I II I I I I I 
TTTCCAACCTAGGTTCTGCTTACTTAGATTATATCTTTCAACATCCTCCGGCCTATGTTT 
421 + + 4* + + + 4 8 o 

AAAGGTTGGATCCAAGACGAATGAATCTAATATAGAAAGTTGTAGGAGGCCGGATACAAA 

MboII 
I 

GGTCAGTTTTTCTTCTTTTA 

481 + + 500 

CCAGTCAAAAAGAAGAAAAT 
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Figure 14: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 14; ORF: cpnl00686 

1 MVSSPILNVP LKNHASVSGK FTHREVSKLA SDLKSGAMSF VPEVLSEETI 
51 SSDLGKKQCT QGIISACCGL AML.IVLMSVY YRFGGVIASG AVLLNLLLIW 
101 AALQYLDAPL TLSGLAGIVL AMGMAVDANV LVFERIREEF LLSQSLKKSV 
151 EKGYTKAFGA I FDSNIjTTVLi ASALLFFLDT GP IKGFALTL ILGIFSSMFT 
2 01 ALFMTKFFFM LWMNKTQHTQ LHMMNKFVGI KHDFLRGCKK LWAVSGSVFL 

2 51 LGCVALGFGA WNSVLGMDFK GGYAFTFNPK EHGISDVAQM RGKWHKLQE 

3 01 AGLSSRDFRI QTFGSSEKIK IYFSDKALSY TKQIRASLLK LTIMSWRYCG 

3 51 IWRNRPRFL YGNSKRNAKF WSKVSSKLSK KMRYQATIGL LGALAI ILLY 

4 01 VSLRFEWQYA FSAVCALIHD LLATCAVLFI AHFFLKKIQI DLQAIGALMT 
451 VLGYSLNNTL IIFDRIREDR QANLFTPMHV LVNDALQKTF SRTVMTTATT 

5 01 LSVLLMLLFI GGSSVFNFAF IMTIGILLGT LSSLYIAPPL LLFMVRKENR 
551 SK 



m Possible T cell epitope: 

Jj* 427 VLFIAHFFL 

in 

EO Possible B cell epitope: 

£0 465 RIREDRQAN 



b 
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Figure 15: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 15; ORF: cpnl00696 

1 MSSNLHPVGG TGTGAAAPES VLNIVEEIAA SGSVTAGLQA ITSSPGMVNL 

51 LIGWAKTKFI QPIRESKLFQ SRACQITLLV LGILLWAGL ACMFIFHSQL 

101 GANAFWLIIP AAIGLIKLLV TSLCFDEACT SEKLMVFQKW AGVLEDQLDD 

151 GILNNSNKIF GHVKTEGNTS RATTPVLNDG RGTPVLSPLV SKIARV 



Possible T cell epitope: 
133 KLMVFQKWA 



Possible B cell epitope: 
163: VKTEGNTS RAT 
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Figure 16: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 16; ORF : cpnl00709 

1 MTIRILAEGL AFRYGSKGPN IIHDVSFSVY DGDFIGIIGP NGGGKS TLTM 

51 LILGLLTPTF GSLKTFPSHS AGKQTHSMIG WVPQHFSYDP CFPISVKDW 

101 LSGRLSQLSW HGKYKKKDFE AVDHALDLVG LSDTTTTAFA HLSGGQIQRV 

151 LIiARAlASYP EILILDEPTT NIDPDNQQRI LSILKKLNRT CTILMVTHDL 

201 HHTTNYFNKV F YMNKTLHF I GRHFDLNRPI LLSSYKNQEF SCSPH 



Possible T cell epitope: 
212 YMNKTIiHFI 



Possible B cell epitopes: 
109 SWHGKYKKKDFE 

E3 

U3 166 DEPTTNIDPDNQQR 

m 

w 



Q 

I* 
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Figure 17: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 17; ORF: cpnl00710 

1 MHKVIVFIFL TLYSLKSYGN DVIDKPHVLV SIAPYKFLVE QIAEETCFVY 
51 AIVTNHYDPH TYELPPQQIK ELRQGDLWFR IGEAFGKNLL EKPYMQQVDL 
101 SQNVSLIQGK PCCN QHTTNY DTHTWLSPKN LKVQVETIVT TLSKKYPQHA 
151 TLYQStfGEKL LLALDQLNEE ILTITSKAKQ RHILVSHGAF GYFCRDYNFS 
201 QHTIEKSSHV EPSPKDVARV FRDIEQYKIS SVILLEYSGR RSSAMLADRF 
251 HMHTVNLDPY AENVLVNLKT IATTFSSL 



Possible T cell epitope: 
125 WLSPK2TLKV 



Possible B cell epitope: 

5 5 NHYDPHTYELPPQQ IKELRQGD 
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Figure 18: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 18; ORF : cpnl00711 

1 MGPGSVLSNH SKEAGGIAIN NVIIDFSEIV PTKDNATVAP PTLKLVSRTN 
51 ADSKDKIDIT GTVTLLDPNG NLYQNSYLGE DRDITLFNID NSASGAVTAT 
101 NVTLQGNLGA KKGYLGTWNL DPNSSGSKII LKWTFDKYLR WPYIPRDNHF 
151 YINSIWGAQN SLVTVNQGIL GNMLNNARFE DPAFNNFWAS AIGSFLRKEV 
201 SRNSDSFTYH GRGYTAAVDA KPRQEFILGA AFSQVFGHAE S E YHLDNYKH 
251 KGSGHSTQAS LYAGNIFYFP AIRSRPILFQ GVATYGYMQH DTTTYYPS I E 
3 01 EKNMANWDSI AWLFDLRFSV DLKEPQPHST ARLTFYTEAE YTRIRQEKFT 
351 ELDYDPRSFS ACS YGNLAI P TGFSVDGALA WREIILYNKV SAAYLPVILR 
401 NtfPKATYEVL STKEKGNWN VLPTRNAARA EVSSQIYLGS YWTLYGTYTI 
451 DASMNTLVQM ANGGIRFVF 




irk 



Possible T cell epitope: 
312 WLFDLRFSV 

Possible B cell epitope: 
24 0 : ESEYHLDNYKHKGSGHST 



SUBSTITUTE SHEET (RULE 26) 




9 09/868987 



WO 00/39158 



PCT/CA99/01230 



89/96 



Figure 19: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 19; ORF : cpnl00877 

1 MRFSLCGFPL VFSFTLLSVF DTSLSATTIS LTPEDSFHGD SQNAERSYNV 
51 QAGDVYSLTG DVSISNVDNS ALNKACFNVT SGSVTFAGNH HGLYFNNISS 
101 GTTKEGAVLC CQDPQATARF SGFSTLSFIQ SPGDIKEQGC LYSKNALMLL 
151 NNYWRFEQN QSKTKGGAIS GANVTIVGNY DSVSFYQNAA TFGGAIHSSG 
2 01 PLQIAVNQAE IRFAQNTAKN GSGGALYSDG DIDIDQNAYV LFRENEALTT 
251 A I GKGGAVC C LPTSGSSTPV PIVTFSDNKQ LVFERNHS IM GGGAIYARKL 
301 SISSGGPTLF INNISYANSQ NLGGAIAIDT GGEISLSAEK GTITFQGNRT 
351 SLPFIxNGIHL LQNAKFLKLQ ARNGYSIEFY DPITSEADGS TQLNINGDPK 
401 NKEYTGTILF SGEKSLANDP RDFKSTI PQN VNLSAGYLVI KEGAE VTVS K 
451 FTQSPGSHLV LDLGTKXi IAS KEDIAITGLA IDIDSLSSSS TAAVT KANTA 
501 NKQISVTDSI EL I S PTGNAY EDLRMRNSQT FPLLSLEPGA GGSVTVTAGD 
551 FLPVSPHYGF QGNWKLAWTG TGNKVGEFFW DKINYKPRPE KEGNLVPNTL 
601 WGNAVDVRSL MQVQETHASS LQTDRGLWID GIGNFFHVSA SEDNIRYRHN 
651 SGGYVLSVNN EITPKHYTSM AFSQLFSRDK DYAVSNNEYR MYLGSYLYQY 
701 TTSLGNIFRY AS RNPNVNVG ILSRRFLQNP LMIFHFLCAY GHATNDMKTD 
751 YANF PMVKNS WRNNCWAIEC GGSMPLLVFE NGRL.FQGAIP FMKLQLVYAY 
801 HGDFKETTAD GRRFSNGSLT SISVPLGIRF EKLALS QDVL YDFSFSYIPD 
851 IFRKDPSCEA ALVISGDSWL VPAAHVSRHA FVGSGTGRYH FNDYTELLCR 
901 GS IECRPHAR NYNINCGSKF RF 



Possible T cell epitope: 
14 6 ALMLLNNYV 



Possible B cell epitope: 
581 DKINYKPRPEKEG 
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Figure 20: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 20; ORF : CPN100325 

1 MPSSWKRLLQ VLSHKIAATE SGGGIYAKDI QLQALPGSFT ITDNKVETSL 
51 TTSTNLYGGG IYSSGAVTLT NISGTFGITG NSVINTATSQ DADIQGGGIY 
101 ATTSLS INQC NTPILFSNNS AATKKTSTTK QIAGGAIFSA AVTIENNSQP 
151 IIFLNNSAKS EATTAATAGN KDSCGGAIAA NSVTLTNNPE ITFKGNYAET 
201 GGAIGCIDLT NGSPPRKVSI ADNGSVL.FQD NSALNRGGAI YGETIDISRT 
251 GATFIGNSSK HDGSAICCST ALTLAPNSQL I FENNKVTET TATTKAS INN 
301 LGAAIYGNNE TSDVTISLSA ENGSIFFKNN LCTATNKYCS IAGNVKFTAI 
351 EASAGKAISF YDAVNVPPKK QLLKS 



Possible T cell epitope: 
226 VLFQDNSAL 



Possible B cell epitope: 
257 NSSKHDG 
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Figure 21: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 21; ORF: CPN100368 

1 MKYSLPWLLT SSALVFSLHP LMAANTDLSS SDNYENGSSG SAAFTAKETS 

51 DASGTTYTLT SDVSITNVSA ITPADKSCFT NTGGALS FVG ADHSLVLQTI 

101 ALTHDGAAIN NTNTALSFSG FSSLLIDSAP ATGTS GGKGA I CVTNTEGGT 

151 ATFTDNASVT LQKNTSEKDG AAVSAYSIDL AKTTTAALLD QNTSTKNGGA 

2 01 LCSTANTTVQ GNSGTVTFSS NTATDKGGG I YSKEKDSTLD ANTGWTFKS 
251 NTAKTGGAWS SDDNLALTGN TQVLFQENKT TGSAAQANNP EGCGGAICCY 

3 01 LATATDKTGL AISQNQEMSF TSNTTTANGG AIYATKCTLD GNTTLTFDQN 

3 51 TATAGCGGAI YTETEDFSLK GSTGTVTFST NTAKTGGALY SKGNSSLTGN 

4 01 TNLLFSGNKA TGPSNSSANQ EGCGGAILAF IDSGSVSDKT GLSIANNQEV 
451 SLTSNAATVS GGAIYATKCT LTGNGSLTFD GNTAGTSGGA IYTETEDFTL 

5 01 TGSTGTVTFS TUTAKTGGAL YSKGNNSLSG NTNLLFSGNK ATGPSNSSAN 
551 QEGCGGAILS FLESASVSTK KGLWIEDNEN VSLSGNTATV SGGAIYATKC 

6 01 ALHGNTTLTF DGNTAETAGG AIYTETEDFT LTGSTGTVTF STNTAKTAGA 
651 LHTKGNTS FT KKTKAL.VFSGN SATATATTTT DQEGCGGAIL CNISESDIAT 
701 KS LTLTENE S LSFINNTAKR SGGGIYAPKC VISGSESINF DGNTAETSGG 
751 AIYSKNLSIT ANGPVSFTNN S GGKGGAI Y I ADSGELSLEA IDGDITFSGN 
8 01 RATEGTSTPN SIHLGARGKI TKLAAAPGHT IYFYDPITME APASGGTIEE 
851 LVINPWKAI VPPPQPKNGP I 



iQ Possible T cell epitope: 
7 WLLTSSALV 

H Possible B cell epitopes: 

C3 162 QKNTSEKDG 

U 53 8 GNKATGPSNS SAZJQEG 
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Figure 22: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 22; ORF: CPN100624 

1 MTNSIFISKF GCLCDPFVSA FYPTALCCSL SGNEVPNLAS CQMSRKDISA 
51 FHTSPSFRLN VTPEPLVSSF RPSNLLNGFG HDITQDITIT GNSINSVIDY 
101 NYHYEDGGIL ACKNLFISEN KGNLS FERNS SHSSGGALYS VRECWISKNQ 
151 NYSFISNAAS LATTTTSGFG GAIHAIJ3SYI TNNLGEGQFL DNVSKNRGGA 
201 IYVGVSLSIT DNLGPIVIKK NQTLEDSSFG GGIFCRAVNI ERNYQNIQIN 
251 DNASGQGWY FLPLGVIISS NKEIIEISNH SASSINTASG KLYPGGGGIM 

3 01 CTSLSHENNP KGL I FNNKTA AL SGG VYTRD LSSSKITVRT AF INNS ATS G 
351 GALINLSGIG STPQNFFLSA DYGDILFNNN TITSSSPQPG YRNALYAAPG 

4 01 INLKLGARQG YKILFYDPID HDQTTTDP IV FNYEPHHLGT VLFSGINVDS 
451 NATNPLNFLS KFSNSSRLER GVLAIEDRAA ISCKTLSQTG GILRLGNAAL 
501 IRTKGPGSSI NFNAIAINLP SILQSEASAP KFWIYPTLTG STYSEDTSST 
551 ITLSGPLTFL NDENENPYDS LDLSEPRKDI PPPLPPRCDC KKIDTSNLIV 
601 EAMNLDEHYG YQGIWSPYWM ETTTTTS STV PEQTNTNHRQ LYVDWTPVGY 
651 RPNPERHGEF IANTLWQSAY NALLGIRILP PQNLKEHDLE AS LQGLGLL I 
701 NQHNREGRKG FRNHTTGYAA TTSAKTAARH SFSLGFAQMF SKTRERQSPS 
751 TTS SHNYFAG LRFDSLLFRD FISTGLSLGY SYGDHHMLCH YTEILKGSSK 
801 AFFNNHTLVA SLDCTFLPAR ITRTLELQPF ISAIALRCSQ AS FQETGDH I 
851 RKFHPKHPLT DLSSPIGFRS EWKTSHHIPM IiWTTEISYVP TLYRKNPEMF 
901 TTLLISNGTW TTQATPVSYN SVAAKIKNTS QLFSRVTLSL DYSAQVSSST 
951 VGQYLKAESH CTF 



Possible T cell epitope: 
640 QLYVDWTPV 



Possible B cell epitopes: 
701 NQHNREGRKGFRNHTTG 
741 SKTRERQSPSTTSSHNY 
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Figure 23: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 23; ORF : CPN100633 

1 MTILRNFLTC SALFLALPAA AQWYLHESD GYNGAINNKS LEPKITCYPE 
51 GTSYIFLDDV RISNVKHDQE D AGVF INRSG NLFFMGNRCN FTFHNLMTEG 
101 FGAAISNRVG DTTLTLSNFS YLAFTSAPLL PQGQGAIYSL GSVMIENSEE 
151 VTFCGNYSSW SGAAIYTPYL LGSKASRPSV NLSGNRYLVF RDMVSQVYGG 
201 AISTHNLTLT TRGPSCFENN HAYHDVNSNG GAIAIAPGGS ISISVKSGDL 
251 I FKGNTAS QD GNTIHNSIHL QSGAQFKNLR AVSESGVYFY DPISHSESHK 
301 ITDLVINAPE GKETYEGTIS FSGLCLDDHE VCAENLTSTI LQDVTLAGGT 
351 LSLSDGVTLQ LHSFKQEASS TLTMSPGTTL LCSGDARVQN LHILIEDTDN 
401 FVPVRIRAED KDALVSLEKL KVAFEAYWSV YDFPQFKEAF TIPLLELLGP 
451 SFDSLLLGET TLERTQVTTE NDAVRGFWSL SWEEYPPSLD KDRRITPTKK 
501 TVFLTWNPEI TSTP 



Possible T cell epitope: 




482 



Possible B cell 



640 



WEEYPPSLD KDRRITPTKK 



QLYVDWTPV 



epitope : 
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Figure 24: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 24; ORF: cpnl0098 5 

1 MGISLPELFS NLGSAYLDYI FQHPPAYVWS VFLLLLARLL PIFAVAPFLG 

51 AKLFPSPIKI GISLSWLAII FPKVLADTQ I TNYMDNNLFY VLLVKEMIIG 

101 IVIGFVLAFP FYAAQS AGS F ITNQQGIQGL EGATSLISIE QTSPHGILYH 

151 YFVTIIFWLV GGHRIVISLL LQTLEVIPIH SFFPAEMMSL SAPXWITMIK 

2 01 MCQLCLVMTI QLSAPAAIiAM LMSDLFLGII NRMAPQVQVI YLLSALKAFM 

2 51 GLLFLTLAWW FIIKQIDYFT LAWFKEVPIM LLGSNPQVL 



Possible T cell epitope: 
83 YMDNNLFYV 



Possible B cell epitope: 
7 8 TQITUYMDNN 
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Figure 25: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 25; ORF : cpn!00987 

1 MKHSKEDDLS RFLPKNLLVE SPHPEEIPLK SLSFTMSWLP TTHPSWITIA 

51 MKEFPPEIQG QLLAWLPEPL VQEILPLLPG ISIAPHRCAP FGAFYLLDML 

101 SKKIRPCGIT EEIFLPASSA NAILYYTGPV KIALINCLGL YSIAKELKHI 

151 LDKWIERVK NALSPTEKLF LTYCQSHPMK HLETTNFLSS WTTDAELRQF 

201 VHKQGLEFLG KALTKENASF LWYFLRRLDV GRAY I VEQTL KTWYDHPYVD 

251 YFKSRLEQCM KVLVTC 




Possible T cell epitope: 
22 0 FLWYFLRRL 

Possible B cell epitope: 
1 MKHSKEDDLSR 
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Figure 26: Identification of T- and B-cell epitopes from the amino acid 
sequences SEQ ID No. 26; ORF: cpnl00988 

1 MLAFFATS FK SVLFEYSYQS LLLILIVSAP PIILASIVGI MVAIFQAATQ 
51 IQEQTFAFAV KLWIFGTLM ISGGWLSNMI LRFAGQIFQN FYKWK 




Possible T cell epitope: 
21 LLLILIVSA 

Possible B cell epitope: 
89 QNFYKWK 

%Q 
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